Gene Cagg_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1155 
Symbol 
ID7267904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1423860 
End bp1425077 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID643565999 
ProductCitrate transporter 
Protein accessionYP_002462501 
Protein GI219848068 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.52181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000561595 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACACCA TCATCTCAGT ACTGACGTTG CTGATCGTCG CCGCGACCAT CTTTGGGGTC 
GCGGTCGGGC GCTGGCCATT GCTACGCGCC GACCGTACTA CGATTACCCT GATCGGTGCT
GCACTGCTGC TTGGCATCGG CGCGATGTCG TTGGAAGAAG CGTATGCGGC GATTGATTTC
GACACCATCC TGTTGCTGTT TAGCATGATG GTGATCAACG GCAGTCTCTT TCTGAGCGGT
TGTTTTGGGG TTATTACCCA GCGTGTAGTG CAATTCGCCC GTGGCCCGCG CTCATTGTTG
GCGTTGGTCA TCGGCGCGAG TGGCGTGCTG TCGGTGCTGT TCCTCAACGA TACAATTGTG
CTCATGATGA CGCCGATCGT GCTCGATGTC ACTCGCGCCC TACGGCGCAA TCCGTTGCCT
TACCTGATCG GGTTGGCAGT CGCGGCCAAC ATCGGCTCGA CCGCTACCAT CACCGGTAAT
CCGCAGAACA TTATCATCGG TAGTGCCTCG AAGATTCCCT ATCTCGACTT CGCCGCTGCG
CTAACGCCAA CTGCGTTGAT CGGCCTCGTC ATCTGCTGGG TGATCGTCAT GCTGATCTAC
CGTGACGAAT TTCGGAGCGG AGCATTGGTG GCACCCAACG TCTTACGCAC GCGGGTCTAC
CGACCATTGT TACGCAAAGC CGGTGTTGTG ATCGTCCTGA TGCTGATCGC GTTTCTGGTT
GGCGTACCGG TCCCGCTTGC AGCGTTTGTC GCTGCCGGTG CGTTACTCGC TACCCGCCGT
TTTCGGCCCG AACGGGTCTA CAAGACCATT GACTGGGGCC TCTTGACCTT CTTTGCCGGG
CTATTCGTCG TCACTCACGC GCTAGAGACG CAGGGTTGGA CCGAACAGCT CTTTGCTGCG
CTGGCCCCTT TAGCGCAAGC CGGCATGGTG CCGTTTGGCG TGGTGTCGGT CGTCCTGTCA
AACGTAATCA GTAATGTACC GGCGGTGTTA TTGTTGCAGA ACGTGATCCC GGCCTTTGCC
GATCAGCAGC GGGCCTGGCT AACGCTGGCC GCGACCGCCA CGTTGGCCGG TAATTTGACC
TTACTCGGTT CGGTGGCGAA CCTGATTATG GCCGAGCTGG CGGCGCGCTG GGGGGTACGG
GTGAGCTTCG GCGCGTACCT GAAGGTAGGC CTGCCGGTGA CGATCTTGAC CGTGGCGGTG
AGCTTGGTGC TGGTATGA
 
Protein sequence
MDTIISVLTL LIVAATIFGV AVGRWPLLRA DRTTITLIGA ALLLGIGAMS LEEAYAAIDF 
DTILLLFSMM VINGSLFLSG CFGVITQRVV QFARGPRSLL ALVIGASGVL SVLFLNDTIV
LMMTPIVLDV TRALRRNPLP YLIGLAVAAN IGSTATITGN PQNIIIGSAS KIPYLDFAAA
LTPTALIGLV ICWVIVMLIY RDEFRSGALV APNVLRTRVY RPLLRKAGVV IVLMLIAFLV
GVPVPLAAFV AAGALLATRR FRPERVYKTI DWGLLTFFAG LFVVTHALET QGWTEQLFAA
LAPLAQAGMV PFGVVSVVLS NVISNVPAVL LLQNVIPAFA DQQRAWLTLA ATATLAGNLT
LLGSVANLIM AELAARWGVR VSFGAYLKVG LPVTILTVAV SLVLV