Gene Arth_4430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4430 
Symbol 
ID4443304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp50140 
End bp52095 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content70% 
IMG OID639687484 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_829181 
Protein GI116662126 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2217] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01512] heavy metal-(Cd/Co/Hg/Pb/Zn)-translocating P-type ATPase
[TIGR01525] heavy metal translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACG CCTGCGGCTG CAGCGATGAG AAGACCGGGA CCGGCGAATC CGAAGAGGCA 
GAGGAAGCCG TTGGGTTCTG GCAGGTCCGC GAGGTCCGGG CGGCCGCCGT TTCCGGTGTC
CTGCTCCTCG CGGCGTGGAT CGCATCGCTG GCCGCGGGGC CCCAGTGGCT GAGGCTTCCC
CTGGAGATCA TTGCGCTGCT GGTTGCAGCC TGGACGTTTG TTCCTTCGAC CGTGCGGCGC
CTTGCTAAGG GCAAGATCGG CGTGGGGACG CTGATGACGA TCGCCGCCGT CGGTGCGGTC
GCCCTCGGGC AGTTCGAGGA GGCCGCCATG CTGGCCTTCC TTTACGCCAT CTCCGAAGGC
CTCGAGGAGT ACTCGATGGC GAAAACCCGC CGCGGCCTGC GAGCCCTGCT GGACCTGGTC
CCGGCCGAGG CGGCCGTCCT GCGCGGCGGC ACGGAAGTCA AAGTCCCGCC GGCAGAACTC
GTCCCCGGGG ACCGGATGGT TGTCCGGCCC GGGGAGCGTC TGGCCACTGA CGGCAGGATC
ATTACCGGCC GCACGTCCCT GGACACCTCG GCGCTCACCG GCGAATCCGT GCCCGTGGAG
GCCGGGCCGG GAAGCGAGGT CTTTGCCGGG TCGATTAACG GCACCGGCCC GCTCGAGGTC
GAAGTCACCA GCACCGCGGA GAACAACTCG CTGGCCAGAA TCGTGCACAT CGTGGAGGCC
GAGCAGTCCC GCAAGGGCCC GGGCCAGCGC TTGGCCGACT CCATCGCCAG CAAGCTCGTC
CCGGGCATCC TGGTCGTCGC CGCGTTGATC ATCGTCTTTG GCTTCATCGT CGGGGAACCG
CTGCTGTGGT TTGAACGCGC ACTCGTCGTC CTGGTCGCCG CCTCACCCTG CGCCCTGGCC
ATCTCCGTCC CCGTCACGGT CGTCGCCTCC GTCGGTGCGG CCAGCCGCAT CGGCGTGCTC
ATCAAGGGCG GAGGCGCCCT GGAAACCCTT GGCAAAATCC GCACCATCGC GCTGGATAAG
ACCGGAACGC TCACCCGCAA CAAGCCCGCC GTGATCGACG TCGCTGCCAC CGGAACCGCG
ACCAGCGAAC GCGTACTTGC CGTCGCTGCC GGGCTGGAGG CCCGCAGCGA ACACCCGCTC
GCCCGCGCCA TCCTCGCCGC CGCACCGGAT CGGGCGGCTG TCACCGATGT GGACACCGTC
CCCGGCGCCG GTTTGGAAGG CCGGCTCGAG GGCAGGACCG TCCGCCTCGG CCGCCCGGGC
TGGATCAACG CCGGTCCCCT GACCGCCGAG GTCGAGCGGA TGCAGCACGC CGGCGCCACC
GCCGTCCTCA TCGAGGACGA CGGCCAGGTG ATCGGCGCGA TCGCGGTCCG TGACGAGCTG
CGCCCCGAAG CCCGCGACGT CATCGCCCGG CTCACGGCGT CCGGCTACAC CACCGCCATG
CTCACCGGCG ACAACCTCAT CACCGCCACC GCGCTGGGTA AGGCCGCGGG CATCACCGAG
GTCCACGCCG ATCTCCGTCC CGAGGACAAG GCGGAGATCA TCCGCACGCT CAAGGCCCGC
CAGCCCACCG CCATGGTCGG GGACGGCGTC AACGACGCCC CCGCCCTGGC GACGGCCGAC
ACCGGCATCG CGATGGGTGC CATGGGCACC GATGTCGCGA TCGAAACCGC CGACATCGCC
CTGATGGGCG AGGACCTGAA CCACCTGCCC CAGGTCCTGG ACCACGCGCG CAGGACCCGG
GCCATCATGC TCCAGAACGT GGGGCTGTCC CTGCTGTTGA TCGCGGTGCT GATCCCGCTG
GCCCTGTTCG GCATCCTCGG CCTGGCCGCC GTGGTCCTCA TCCACGAACT GGCCGAAATC
GTGGTCATCG CCAACGGCGT CCGGGCCGGC CGGATCAGCC GCAAGACCGC GTTCACGTCC
GCCCAGCCCT CCCCTGCACT GGAACCGTCA GTATGA
 
Protein sequence
MSDACGCSDE KTGTGESEEA EEAVGFWQVR EVRAAAVSGV LLLAAWIASL AAGPQWLRLP 
LEIIALLVAA WTFVPSTVRR LAKGKIGVGT LMTIAAVGAV ALGQFEEAAM LAFLYAISEG
LEEYSMAKTR RGLRALLDLV PAEAAVLRGG TEVKVPPAEL VPGDRMVVRP GERLATDGRI
ITGRTSLDTS ALTGESVPVE AGPGSEVFAG SINGTGPLEV EVTSTAENNS LARIVHIVEA
EQSRKGPGQR LADSIASKLV PGILVVAALI IVFGFIVGEP LLWFERALVV LVAASPCALA
ISVPVTVVAS VGAASRIGVL IKGGGALETL GKIRTIALDK TGTLTRNKPA VIDVAATGTA
TSERVLAVAA GLEARSEHPL ARAILAAAPD RAAVTDVDTV PGAGLEGRLE GRTVRLGRPG
WINAGPLTAE VERMQHAGAT AVLIEDDGQV IGAIAVRDEL RPEARDVIAR LTASGYTTAM
LTGDNLITAT ALGKAAGITE VHADLRPEDK AEIIRTLKAR QPTAMVGDGV NDAPALATAD
TGIAMGAMGT DVAIETADIA LMGEDLNHLP QVLDHARRTR AIMLQNVGLS LLLIAVLIPL
ALFGILGLAA VVLIHELAEI VVIANGVRAG RISRKTAFTS AQPSPALEPS V