Gene Cag_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0889 
Symbol 
ID3747350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1219728 
End bp1221041 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content47% 
IMG OID637773420 
Producttransporter, putative 
Protein accessionYP_379197 
Protein GI78188859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCA ACAATTCCTA CAAACTTGGA CCTATTACTT TAGCCCCATC GGTGCTACCT 
CGCCATGCGT TAACCTACCT TTATGCCGCA TTTTTTTCCA TTGGCTTAGT AACCTTTGTT
TCCATTGGGC AAACCTACAT TCTTAACGAG CACCTTAAAA TTCCTACCTC ACAGCAGGGT
GCTATTAGTG GCGATTTAGT TTTTTGGACG GAAGTTGTAA CCCTGCTCTT TTTTGTGCCA
GCAGGCATGT TGATGGATCG CATTGGGCGT AAACCTGTTT ATAGCGCGGG ATTTTTATTA
GTTGCTTTGT CGTATGCGCT CTATCCACTG TCGCGCTCCA TTGAGGAGAT GACCATTTAC
CGTATGATTT ACGCGCTTGG CATTGTAGCT TTAACCAGCG CGCTTTCAAC GGTAATGATT
GATTATGCGG CAGAGCGTTC GCGTGGTAAG CTCATTGCAA TTACGGGTTT TCTTAACGGT
ATTGGTATTG TTGTTATTAA CAGCTTTTTT GGTGGATTAC CGCAAAAGCT TATGGCGCAA
GGTTTTAGTG GGATTGAAGC GGGGCTTTAC ACCCATTTTG GTATTGCGGC TATTGCCGTA
GTTGCGGCAG TTGTGGTGGG ATTGGGATTA AAAGGTGGCA CCGAAGTTCG TAAAGAGGAT
CGCCCTCCAC TTCGCTCCCT TTTTACCAGT GGTATTAAGT GTGCAAAAAA TCCTCGTATT
TTGCTCTCCT ATGCCGCCGC CTTTGTAGCG CGTGGCGACC AATCCATTAT TGGAACTTTT
GTGCCGCTAT GGGGTACCAC CACCGGTATT GCCCTTGGCA TGGAACCCGC TGAGGCCGTT
AAGCAAGGAA TGATGATGTT TATTATTTCG CAAGCCGCTG CACTGCTTTG GGCTCCCGTT
ATTGGACCGC TTATTGATCG CTGGAACCGC GTTACGGCAC TCTTTGTTTG CATGGCACTT
GCCAGCGTTG GCTACCTTTC ACTTGGCTTT ATTGGTAATC CTCATGATGC CAATGCCTAC
ATTTTTTTCA TTCTTCTTGG CATTGGGCAA ATTAGCTCCT TCCTTGGCGC TCAATCGTTG
ATTGGGCAAG AAGCGCCAAA AGCTGAGCGT GGTTCAGTGG TTGGCATGTT CAACATCAGC
GGCGCTATTG GCATTCTTAT TATAACCACA CTTGGAGGGC GCTTGTTTGA TAGTTGGAGC
CCCAAAGCCC CTTTTCTTGT AGTAGGTGCT ATTAATGTGC TTGTAATGCT TGCCGCCATT
TACGTGCGTA TAAAAGCACC CGGGAAAAAT CTGCATGTTG CCGAAGAGGG GTAA
 
Protein sequence
MSSNNSYKLG PITLAPSVLP RHALTYLYAA FFSIGLVTFV SIGQTYILNE HLKIPTSQQG 
AISGDLVFWT EVVTLLFFVP AGMLMDRIGR KPVYSAGFLL VALSYALYPL SRSIEEMTIY
RMIYALGIVA LTSALSTVMI DYAAERSRGK LIAITGFLNG IGIVVINSFF GGLPQKLMAQ
GFSGIEAGLY THFGIAAIAV VAAVVVGLGL KGGTEVRKED RPPLRSLFTS GIKCAKNPRI
LLSYAAAFVA RGDQSIIGTF VPLWGTTTGI ALGMEPAEAV KQGMMMFIIS QAAALLWAPV
IGPLIDRWNR VTALFVCMAL ASVGYLSLGF IGNPHDANAY IFFILLGIGQ ISSFLGAQSL
IGQEAPKAER GSVVGMFNIS GAIGILIITT LGGRLFDSWS PKAPFLVVGA INVLVMLAAI
YVRIKAPGKN LHVAEEG