Gene Cagg_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0894 
Symbol 
ID7267966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1122374 
End bp1124296 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content60% 
IMG OID643565742 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_002462249 
Protein GI219847816 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000637035 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACCA TTTCTAATAC TGCACCCCGG CAGCGCCTGA CGCTGCTGCT CAACGACGAA 
ATCCTCGAGC CGGCCTTTGT CGCTTTGACC CTGATCGGTA TCGTCACCGG TCTGATACTG
GAAGGATCGG GTGCGCCGGA GTCGATCATC TTGGTCGTCC ATCTGGCTAC CTACTTTTTT
GGCGGTTTTT ATGCAGTGCG GGCCATCATC GAGGCCTTAC GCCATTGGTC GATTGAAGTT
GACCTGTTGA TGGTATTGGC AGCGCTCGGT GCGGGCTATT TAGGCGATTT TACCGAAGGT
GCGATTCTGC TCTTTCTCTT TTCGTTGAGC AATGTGTTGC AAGCCTATGC AATGCGGCGT
ACCGAACAGG CGATTACCGC GCTGATGCAG TTGCGCCCGG ATACGGTGAC GGTTCATCGC
GATGGGCGTG AACTCGATCT GCCGATTGAG GCGGTGCAAG TGGGCGATGT GATAGTGCTT
CGCCCCGGTG ACCGGGTGCC GCTCGACGGT GTGATCGAAC GGGGGAGCGG TTCGTTTGAC
GAATCGGCGT TGACCGGCGA GTCGATGCCG GTGCAGAAGG GGCCGGGGAT GGCGGTGTTG
GCCGGTACGC TTAACCAGAC TGGCGCGCTG GAAGTGCGGG TAACCAAGCC GGCCAGTGAG
AGTACGTTGG CCCGGATTAT TACGATGGTG AGCGAAGCGC AGGCGCGTAA GGCGCGGTCG
CAGAGCTTTC TTGAATATTT TGAGCAGCGG TATGCAATTG GCGTAATCGT TGCGGTGATT
TTGTTCATCC TTGCCGTACC GGCGCTAACC GGAGCCGACT TTGCCGATAC CTTCTACCGC
GGAATGGTGC TGCTCACAGT CGCTTCGCCG TGCGCGCTCG TGATCAGTGT ACCGGCTTCG
TTACTAAGCG CGATTGCAGC CGGGGCGCGG CGTGGGGTGC TGTTCAAGGG TGGCGTGCAT
CTTGAGGAAT TGAGCAAGGT ACGGGTGATC GCTTTCGACA AAACCGGCAC GTTGACCTTT
GGTAAGCCGA CAATGACCGA TCTCGTGCCG ATGAATGGGG TGGACGAAGC CGATCTATTG
GCGATTGTGG CCCGCGCCGA GCAACCTTCA GAGCACCCGA TTGCGCGTGC CATTTTGCAA
GCCGCCGAAG AACGTGGGAT CACGGTTGCG CCACCCGAGC AGTTTACGGC TGTGACCGGG
ATGGGTGTGC GTGCGATGTG GGAAGGGGTT GAGACACTGG TCGGTTCGCC GCGCTTGTTT
GCCGAGGCCG GGGTGGTTAT GCCGTCGGAG TTGTCGGCGC GGGCCGATGA GCTAATGGCG
CAAGGGCGCG GGAGTGTGTT GTTCGTTCGG CGTGGCGAGC AGTGGTTGGG ATTGGTAGCG
GTGATGGATC GTGAACGGCC CGATGCAGCC CAGCGCATTG CCGAGTTGCG CGCTGCCGGT
ATCGAGCGGA TCGTGATGCT GACCGGCGAT AATCCGCAGG TGGCGGAAGC GATGGCGCGC
CGGCTGGGTG TGGATGAGGT GCATGCCGGC CTGTTGCCCG CCGATAAGCT GCGTATCGTC
GAGCAGTTAC GCCAGCGTTA CGGTGGTGTG GCGATGGTCG GCGACGGTGT GAATGACGCA
CCGGCGTTGG CGGCGGCGAC GGTGGGAATT GCGATGGGGG CTGCCGGTAC CGATGCGGCA
CTCGAGACGG CCGATCTGGT GTTGATGCGC GATGATTTGA GTGCGATTAC TTACGCACTG
CGGCTCAGCC GCCGCACCCA GCGCGTGGTC TGGCAAAATA TTATCTTTGC GCTGGCGGTT
GTGGTAGTGT TGGTGACAAC AACATTGACG GTGGGTGTAC CGTTGCCACT CGGTGTGGTC
GGGCACGAAG GCAGCACGAT TATTGTGGTG CTCAACGGGT TACGGCTATT GATGTTCCGC
TGA
 
Protein sequence
MTTISNTAPR QRLTLLLNDE ILEPAFVALT LIGIVTGLIL EGSGAPESII LVVHLATYFF 
GGFYAVRAII EALRHWSIEV DLLMVLAALG AGYLGDFTEG AILLFLFSLS NVLQAYAMRR
TEQAITALMQ LRPDTVTVHR DGRELDLPIE AVQVGDVIVL RPGDRVPLDG VIERGSGSFD
ESALTGESMP VQKGPGMAVL AGTLNQTGAL EVRVTKPASE STLARIITMV SEAQARKARS
QSFLEYFEQR YAIGVIVAVI LFILAVPALT GADFADTFYR GMVLLTVASP CALVISVPAS
LLSAIAAGAR RGVLFKGGVH LEELSKVRVI AFDKTGTLTF GKPTMTDLVP MNGVDEADLL
AIVARAEQPS EHPIARAILQ AAEERGITVA PPEQFTAVTG MGVRAMWEGV ETLVGSPRLF
AEAGVVMPSE LSARADELMA QGRGSVLFVR RGEQWLGLVA VMDRERPDAA QRIAELRAAG
IERIVMLTGD NPQVAEAMAR RLGVDEVHAG LLPADKLRIV EQLRQRYGGV AMVGDGVNDA
PALAAATVGI AMGAAGTDAA LETADLVLMR DDLSAITYAL RLSRRTQRVV WQNIIFALAV
VVVLVTTTLT VGVPLPLGVV GHEGSTIIVV LNGLRLLMFR