Gene Arth_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4207 
Symbol 
ID4443608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp38990 
End bp40942 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content69% 
IMG OID639687732 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_829429 
Protein GI116662376 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACG CGTGCGGATG CGGCGATGAT AAGCCGGAAA CAGAAGAAGG CCAGGAAGAA 
GCCGAGGGCT TCTGGCAGGT GACCGAGGTC CGCGCGGCCG CGGTCTCCGG AGTCCTGCTG
CTGGTGGCGT GGATTGCTTC GCTGCTCGAC GCATCCGAGT GGGTCACCCT GCCGCTGGAG
GCCGCGGCGC TGCTGGTCGC AGCGTGGACG TTCGTTCCCT CCACCCTGCG CCGGCTGGTC
AAGGGCAAGA TCGGTGTCGG CACGCTGATG ACGATCGCGG CCGTGGGCGC CGTCATCCTG
GGCCAGATCG AGGAAGCGGC GATGCTGGCC TTCCTCTATG CCATCTCCGA GGGCCTGGAA
GAATACTCGG TGGCACGCAC CCGGCGCGGG CTGCGGGCGC TGCTGGACCT GGTCCCGTCC
GAGGCCAGGG TGCTGCGCAA CGGAACCGAA GTTACCGTCT CCCCGTCCGA GCTGGTGCTG
GGTGAGACCA TGGTCGTTCG CCCGGGCGAG CGTCTTGCCA CCGACGGACG CGTCCTGACC
GGGCGCACCT CCCTGGACAC CTCAGCCCTG ACCGGCGAAT CAGTCCCTGT CGAGGCCGGA
CCCGGCAGCG AAGTCTACGC GGGGTCGATC AACGGCACCG GTCCGCTCGA GGTCCAGGTC
ACCAGCACCG CGGAGAACAA CTCCCTGGCC CGGATCGTGC ACATAGTTGA GGCCGAGCAG
TCCCGCAAGG GCCCCGGCCA GCGCCTGGCC GACTCCATCG CCAAAAAGCT TGTCCCCGGG
ATCCTGATCG CCGCCGCGCT CATTGCCGTC TTCGGTTTCA TCGTGGGCGA ACCGGTCCTG
TGGATCGAAC GCGCCCTCGT TGTCCTGGTG GCAGCCTCGC CGTGTGCCCT TGCCATCTCC
GTGCCCGTGA CCGTGGTCGC CGCCGTCGGC GCGGCCAGCC GGATGGGCGT CCTGATCAAG
GGCGGCGGCG CGCTGGAAAC CCTGGGCAAA ATCCGCACCA TCGCCCTGGA CAAGACGGGG
ACCCTGACCC GGAACAAACC CGCCGTGATC GAGGTCGCGG CCACCGCCTC CTCCACCCGG
GAACGGGTGC TGGCCGTCGC CGCCGGCCTG GAAGAACGCA GCGAACACCC GCTCGCCCGG
GCCATCCTCG CCGCCACCAC GGACCGGGTC ACCGTCACCG ACCTGAACAC GGTTCCCGGT
GCGGGGCTCG AAGGGACCAT TGACGGGCAC AGTGCCCGGC TCGGCAGGCC CGGATGGATC
GCCCCCGGGG AGCTGAAGGA AGCCGTCCGG CGGATGCAGG CCGGCGGCGC GACAGCGGTC
CTGGTCGAGG AGCAATCTGT CCTCATCGGT GCCGTCGCCG TACGGGACGA ACTCCGCCCC
GAAGCCAGGG CCGTCATTGA ACGCCTGAAC CGCGCCGGCT ACACCACCGC CATGCTCACC
GGGGATAACC GGCTCACCGC AGAGGCCCTG GGCAAGGCAG CGGGCATCAC CGAAGTCCAC
GCCGACCTGC GCCCCGAAGA CAAGGCGGAC ATCATCCGCA CCCTGAAGGA GCGGCAGCCG
ACCGCCATGG TCGGGGACGG CGTGAACGAC GCCCCGGCCC TGGCGACCGC CGACACCGGC
ATCGCGATGG GCGCGATGGG CACCGACGTG GCCATCGAAA CCGCCGACAT CGCCCTGATG
GGCGAGGACC TGCACCACCT GCCCCAGGTA CTGGAGCACG CCCGCCGTAC CCGCCGGATC
ATGTTCCAGA ACGTGGGCCT GTCCCTGGCC CTGATCGCGG TGCTGATCCC GCTGGCCCTG
TTCGGCATCC TGGGGCTGGC AGCCGTGGTC CTGATTCATG AATTGGCCGA AATCGTTGTC
ATCGCCAACG GCGTTCGCGC CGGCAAGGTC AGCAAGTACG CCACCATCCC GGTTTCCCGG
GACGCCGTCC CCAACCTGGA ACCGGCCCGG TGA
 
Protein sequence
MSDACGCGDD KPETEEGQEE AEGFWQVTEV RAAAVSGVLL LVAWIASLLD ASEWVTLPLE 
AAALLVAAWT FVPSTLRRLV KGKIGVGTLM TIAAVGAVIL GQIEEAAMLA FLYAISEGLE
EYSVARTRRG LRALLDLVPS EARVLRNGTE VTVSPSELVL GETMVVRPGE RLATDGRVLT
GRTSLDTSAL TGESVPVEAG PGSEVYAGSI NGTGPLEVQV TSTAENNSLA RIVHIVEAEQ
SRKGPGQRLA DSIAKKLVPG ILIAAALIAV FGFIVGEPVL WIERALVVLV AASPCALAIS
VPVTVVAAVG AASRMGVLIK GGGALETLGK IRTIALDKTG TLTRNKPAVI EVAATASSTR
ERVLAVAAGL EERSEHPLAR AILAATTDRV TVTDLNTVPG AGLEGTIDGH SARLGRPGWI
APGELKEAVR RMQAGGATAV LVEEQSVLIG AVAVRDELRP EARAVIERLN RAGYTTAMLT
GDNRLTAEAL GKAAGITEVH ADLRPEDKAD IIRTLKERQP TAMVGDGVND APALATADTG
IAMGAMGTDV AIETADIALM GEDLHHLPQV LEHARRTRRI MFQNVGLSLA LIAVLIPLAL
FGILGLAAVV LIHELAEIVV IANGVRAGKV SKYATIPVSR DAVPNLEPAR