Gene Cag_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1600 
Symbol 
ID3746675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2090597 
End bp2091880 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content38% 
IMG OID637774140 
ProductATPase 
Protein accessionYP_379898 
Protein GI78189560 
COG category[R] General function prediction only 
COG ID[COG3950] Predicted ATP-binding protein involved in virulence 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0426773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTG AGCATTTGAT AGTAAAAAAT TTCAAAGGCT TTGTGTCTAA AGAATTTACG 
TTTCATCCCA ACTTCAATTT GATTGTTGGC ATGAATGGCA CAGGTAAAAC AAGTATGCTT
GATGCCCTTG CGGTTGCTAT TGGAAGCTGG TTTTTAGGAT TTTATGTTGA TTCGCTGAAA
ATGCGTCAAA TTCGACACGA TGATGTTTTA TTAAAATATA TACAGCATAG TTGGGAACAT
ATTTACCCTT GTGAAGTTGA AGCTTATGGG GTTGTTATGG ATCGTCATAT CAAATGGAGT
CGTGAATTAA ATACTATAAA TGGTCGTACT ACTTATGGAA ATGCATTAGC CATAAAAGAG
TTGGCATTAC AGGCAACTCG CTCTATGCTG AATGGGGATG ACATTATTTT ACCACTGATT
TCTTATTATG GTACGGGACG TTTATGGCAG GAACCACGAG AGGCATTCAA GGTATCTGAT
CCTCGGAAAG TTGCTAATAA GGAAACTCAA TCGCGCAGAA CAGGTTATTT TAATAGTATT
GAACCTCGTT TGTCGGTTAA CCAACTCACT CAATGGATTG CTCAGCAATC GTGGATTGCT
TATCAAGAGC AAGGTCAAGT TTTTCCTGTA TTTAATACAG TACAAGATGC AATAATTGGC
TGTATTGAAG ATGCTAAAAA ACTTTATTTT GATGCTAAGC TTGGTGAAGT AATTGTTGAA
TTTTCATCAG GAACACAACC ATTTTCAAAT TTAAGTGATG GGCAGCGCTG TATGTTGGCA
ATGGTTGGTG ATATTGCACA TAAAGCAGCT AAACTTAATC CACATCTTGG TAGTGATGTG
CTGAAGGAAA CAAATGGTGT GGTGCTGATT GACGAGCTTG ATTTACACCT TCATCCTCGT
TGGCAACGAC GAGTTATTGA GGATTTACGA AACGTATTTC CAAAAATTCA GTTTATTTGT
ACCACACACT CTCCGTTCTT AATTCAATCG TTACGGAGTG GCGAGGAATT AGTGATGCTT
GATGGTCAGC CATTTGCAAC TCTTGGTAAT TTATCCTTAG AAGAGATTGC TCATGGTATT
CAACAAGTGA AAAACCCTGA AGTAAGTTTA CGCTATGAAA GTATGAAAGC AACGGCTAAA
AGTTTTCTTA CAATGCTTGA TGAAGCTTCA TTAGCACCAA AAGAAAAACT GAAACAATTA
GCCGATAAAC TTCGCCCTTA TGCTGATAAT CCAGCATTTC AGGCCTTTCT TGAAATGGAA
CGTATAGCCA AATTAGGAGA GTAA
 
Protein sequence
MRIEHLIVKN FKGFVSKEFT FHPNFNLIVG MNGTGKTSML DALAVAIGSW FLGFYVDSLK 
MRQIRHDDVL LKYIQHSWEH IYPCEVEAYG VVMDRHIKWS RELNTINGRT TYGNALAIKE
LALQATRSML NGDDIILPLI SYYGTGRLWQ EPREAFKVSD PRKVANKETQ SRRTGYFNSI
EPRLSVNQLT QWIAQQSWIA YQEQGQVFPV FNTVQDAIIG CIEDAKKLYF DAKLGEVIVE
FSSGTQPFSN LSDGQRCMLA MVGDIAHKAA KLNPHLGSDV LKETNGVVLI DELDLHLHPR
WQRRVIEDLR NVFPKIQFIC TTHSPFLIQS LRSGEELVML DGQPFATLGN LSLEEIAHGI
QQVKNPEVSL RYESMKATAK SFLTMLDEAS LAPKEKLKQL ADKLRPYADN PAFQAFLEME
RIAKLGE