Gene Cag_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1237 
Symbol 
ID3748270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1643795 
End bp1644805 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content52% 
IMG OID637773770 
Productarginine/ornithine transport system ATPase 
Protein accessionYP_379541 
Protein GI78189203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCACC ACCATACATT TGATGTAGAA GCCATAGCCA ATGCCATTAT GCAAGGCAAT 
CGCCACCAGC TTAGTCGAGC AATTACGTTG GTGGAGTCGC AACGCATAGA GCACCACCAT
GTAGCTGAAG CTATTCTTGA GCGTTGTATG GCAAGCAATC GCCATGCGCT ACGTATTGGT
ATTACGGGTT CGCCCGGTGC GGGCAAAAGC ACCTTTATTG AGGCTTTTGG CGAACATATT
CTTTCGCAAG GATTAAGGCT TGCGGTGCTG GCAATTGACC CAAGTAGCCA CCATTCCAAA
GGAAGTATTC TTGGCGATAA AGCACGAATG GAAAAGCTTT CGGGACGAAA AGAGGCGTTT
ATTCGTCCAA CGCCTTCATC GGGGCATCTT GGCGGCACTT CACCCCGAAC GCACGAGGCG
CTATTGCTGT GCGAGGCGGC TGGTTATGAC GTAATAATTG TGGAGACGGT GGGTGTTGGG
CAGTCGGAGC TGCACATTGA GCAGATGGTA GATTTTGTGC TGCTTTTAAT GCTGCCCGGT
TCGGGCGATG AGCTGCAAGG CATTAAGCGA GGAATTATGG AAATTGCCGA TATGATTGCC
ATCACCAAAT GCGATGGTTT GCAAGCCACC AGCGCGGCTA TTTCTCATGC AGAATTTGAA
GCGGCGCTGC GCATGGTGCC AAAGCGCCAC CCCTTTTGGC AGCCAAGCGT GCAACTTACC
TCGGCGGTTA CGGGTGTGGG CATTGCTGAG GTGTGGCAGC AAATTGAACG TTTTTTTGCT
ATCATGCAGC AAGAGAATAG TTTAGAGACT CAGCGGCGTG AGCAACGGCG CCATTTGTTG
GCAAATGTGC TGGAAGAGCA ACTCCGCCGC CTCTTTTTTA ACCACCCCAC AATTCGTCAG
CAGCAACCCC ATCTTGTGCA GCAAGTGCTT GATGGCACGC TTAGCCCATT TACCGCCGCC
ACACGCCTTA TTGAGCTGTT TCGCCACAAT CCAATAGGAG AGAAACAGTA G
 
Protein sequence
MPHHHTFDVE AIANAIMQGN RHQLSRAITL VESQRIEHHH VAEAILERCM ASNRHALRIG 
ITGSPGAGKS TFIEAFGEHI LSQGLRLAVL AIDPSSHHSK GSILGDKARM EKLSGRKEAF
IRPTPSSGHL GGTSPRTHEA LLLCEAAGYD VIIVETVGVG QSELHIEQMV DFVLLLMLPG
SGDELQGIKR GIMEIADMIA ITKCDGLQAT SAAISHAEFE AALRMVPKRH PFWQPSVQLT
SAVTGVGIAE VWQQIERFFA IMQQENSLET QRREQRRHLL ANVLEEQLRR LFFNHPTIRQ
QQPHLVQQVL DGTLSPFTAA TRLIELFRHN PIGEKQ