Gene Cag_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1339 
Symbol 
ID3746854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1806943 
End bp1808520 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content44% 
IMG OID637773877 
Productputative sugar transport protein 
Protein accessionYP_379642 
Protein GI78189304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.922881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGTA ATGTTGTAAA TGCTGAGCAA ACGGAAGAGG TTAGCAGCAC TCGGCGGGTT 
ATTGCTGCAT CCTCCGTTGG CACGCTCATT GAGTGGTACG ACTTCTACAT TTTTGGAAGT
CTTGCCAAAA TTATTTCCGA ACAATTTTTC CCAAAAGACA ATCCAACAGC AGCCTTGCTG
GCAACGCTTG CTACTTTTGC GGCTGGCTTT GTGGTACGCC CCTTTGGTGC CCTCTTTTTT
GGTCGCTTGG GCGATCTTAT CGGAAGAAAG TACACCTTTC TTGTTACGCT GGTTATTATG
GGTGGCTCAA CCTTTGCAAT TGGTTTAGTA CCTGGTTACG CAACTATTGG TTTTGCTGCG
CCTGCAATTG TGTTTGTGCT GCGCTTACTG CAAGGTTTAG CGCTGGGCGG TGAGTATGGC
GGTGCGGCAA CCTATGTGGC GGAGCACTCT CCAAACGGTA AACGTGGTTT TTGGACAAGC
TTTATTCAAA CCACAGCAAC CTTTGGTCTC TTTCTGTCGC TTGGCGTTAT TTTAATTGTT
CGCCAAACGC TTGGTGTTGA AACCTTTCAA GATTGGGGCT GGCGCGTACC ATTTATTCTT
TCTGCATTTT TAGTTGGCGT TTCAATTTAC ATCCGCATGA AAATGTCGGA ATCGCCAATG
TTTGCTAAAA TGAAGAAAGA GGGCAAAACC TCAGCTAATC CACTTGCCGA AAGCTTTAAG
CAAAAGGATA ACCTGAAAAT GGTGCTGCTT GCTTTGCTTG GTGCTACGGC TGGTCAAGGT
GTGGTTTGGT ACACAGGTCA ATTCTATGCT CTTTCATTTT TGCAAAACGC TTGCAACATT
GAGTTTGAGC AAAGCAACTT GATTATTCTT ATTGCACTTG TTATTGGCAC CCCATTCTTT
GTGATTTTTG GTGCGCTCTC CGACAAAATT GGTCGTAAGT ACATTATGAT GGCTGGTATG
TTTATTGCCG TGCTTGCTTA TCGTCCTATT TACACCATGA TGTACAACGA TGCCAATCTC
AAAAATAAAA TTGAGATTGT TGACCAAACC ACCGTTGAAA CCAAAGAAGA GGTAAAAGGC
ACCGACAACG TTATTACCAC CGTAACGAAA AAAACTTTTG AGGATGGTAC CACTTACAAA
GAAATCAAAA AAGAGACCAT CCCGCTTGAT GCTGCAAAAA AAGCTGAACT TGCTGCTGCC
GACAAGCTAA AGCCTGAAAC CAAAAAAGAG GTAGTTCTGC CACAGCACTT GTACTACAAA
ATGATTGGTT TAGTGCTAAT TCAGGTGATT TTTGTTACCA TGGTGTATGG TCCAATTGCA
GCATTCCTTG TTGAAATTTT CCCAACACGC ATTCGCTACA CCTCCATGTC GCTCCCTTAC
CACATTGGTA ACGGTGTATT TGGTGGTTTA GTACCGCTGA TTTCAACCCG TCTTGTAGAA
GCAACCCGTC CTGCTGCTGG CTTACCTCCA GCCGATCCGC TTGCTGGCTT GTGGTATCCA
ATTATTATTG CTGGCGTAAG CTTTGTTATT GGTATGCTTT ACATTTCAAA CAACACCAAC
AACATGGACG TTGAGTAA
 
Protein sequence
MARNVVNAEQ TEEVSSTRRV IAASSVGTLI EWYDFYIFGS LAKIISEQFF PKDNPTAALL 
ATLATFAAGF VVRPFGALFF GRLGDLIGRK YTFLVTLVIM GGSTFAIGLV PGYATIGFAA
PAIVFVLRLL QGLALGGEYG GAATYVAEHS PNGKRGFWTS FIQTTATFGL FLSLGVILIV
RQTLGVETFQ DWGWRVPFIL SAFLVGVSIY IRMKMSESPM FAKMKKEGKT SANPLAESFK
QKDNLKMVLL ALLGATAGQG VVWYTGQFYA LSFLQNACNI EFEQSNLIIL IALVIGTPFF
VIFGALSDKI GRKYIMMAGM FIAVLAYRPI YTMMYNDANL KNKIEIVDQT TVETKEEVKG
TDNVITTVTK KTFEDGTTYK EIKKETIPLD AAKKAELAAA DKLKPETKKE VVLPQHLYYK
MIGLVLIQVI FVTMVYGPIA AFLVEIFPTR IRYTSMSLPY HIGNGVFGGL VPLISTRLVE
ATRPAAGLPP ADPLAGLWYP IIIAGVSFVI GMLYISNNTN NMDVE