Gene Arth_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4157 
Symbol 
ID4447572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4679022 
End bp4680512 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content66% 
IMG OID639691988 
Productmetal dependent phosphohydrolase 
Protein accessionYP_833632 
Protein GI116672699 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR02692] tRNA adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCACG CACATCAAAA GTCCGACCCG CACACCGTCG ATTTCCAGGT GGACCCGGTG 
GTCCTGGAGC TCGGGCAGCG CTTCGTCGAC GCCGGGTATG AACTGTCGCT GGTGGGCGGG
CCCGTACGTG ACCTGTTCCT GGGCAGGCGC TCCCCTGATC TCGACTTCAC CACGGACGCC
ACGCCGGACC AGACTCTGGC ACTGATACGG AAGTGGGCGG ACAACTTCTG GGAGATCGGC
AAGGCGTTCG GCACCATCGG AATGCGGAAG TCCGGTTTCC AGATTGAAAT CACCACCTAT
CGCGCCGAGG CGTACGATCC GGAGTCGCGG AAGCCCATGG TGGCGTTTGG GTCCTCGCTG
ACCGATGACC TGCTTCGCCG GGACTTCACG ATCAATGCCA TGGCGCTGCG GCTGCCCTCA
ATGGAACTGA TCGACCCCTT CGGCGGTGTG CGCGACCTCC ACGCCTCCGT GCTGGCCACG
CCCGGTGCAC CGGAGACGTC CTTTTCCGAT GACCCCCTGC GCATGATGCG GGCGGCCCGC
TTTGCCGCCC AGCTCGGCGT GGCGGTCCGC GGGGACGTCA AGCACGCAAT GTCGCAGATG
GCGGAACGGA TCAGCATCAT TTCCGCGGAG CGTGTCCGCG AGGAACTGGT GAAGCTCATC
TGCGGAGCCC ACCCCCGCGT GGGCATCGAC CTCCTCGTGG ACACCGGCCT GGCCGAATTC
GTCCTGCCGG AAGTGTCCGC CCTGCGCCTG GAGGCGGACG AACACCACCG GCACAAGGAT
GTCTACCAGC ACTCCCTCCA GGTGCTGGAG CAGGCCGCTT CCTTGGAGAC AGATTCCGAC
GGCGCAGTGC CCGGCCCCGA TTTTGTGCTG CGGTTTGCGG CCCTCATGCA CGACGTCGGC
AAGCCGGCTA CGCGCCGTTT TGAACCGGGC GGCGCGGTGA GCTTCCGCCA CCACGACATG
GTGGGTTCCA AACTGACCAA GAAACGGATG AAGGCCCTCC GCTTCGACAA CGACACCATC
AAGGCAGTGG CACGGCTCGT CGAGCTGCAC ATGCGCTTCT ACGGTTACGG GGAGGCCGGC
TGGAGCGACT CCGCGGTCCG CCGGTACGTT ACGGACGCAG GGCCGCTCCT GGAGCGCCTG
CACCGGCTGA CCCGGTCAGA CGTGACCACC CGGAACCAGC GCAAGGCGGA CCGGCTGTCC
TTTGCCTATG ACGACCTTGA GGCGCGCATC GCCGCACTGC GTGAGCAGGA ATCGCTGGAA
GCGGTACGGC CGGACCTGGA CGGCGGCCAG ATCATGGCCC TGCTGGGGCT GAAGCCGGGG
CCCGTGGTGG GCAAGGCATA CAAGTTCCTG CTTGAACAGC GGATGGAGCA CGGCCCGCTC
GATCCTGTTG TGGCAGAGGC CAGGTTGCGC GAGTGGTGGG CGGACCAGCC CGAGGCCGCC
GAACACCAGC CGGCCGCCGG CGTCGAACTT TCCACTACCG AGGAGTCATA A
 
Protein sequence
MAHAHQKSDP HTVDFQVDPV VLELGQRFVD AGYELSLVGG PVRDLFLGRR SPDLDFTTDA 
TPDQTLALIR KWADNFWEIG KAFGTIGMRK SGFQIEITTY RAEAYDPESR KPMVAFGSSL
TDDLLRRDFT INAMALRLPS MELIDPFGGV RDLHASVLAT PGAPETSFSD DPLRMMRAAR
FAAQLGVAVR GDVKHAMSQM AERISIISAE RVREELVKLI CGAHPRVGID LLVDTGLAEF
VLPEVSALRL EADEHHRHKD VYQHSLQVLE QAASLETDSD GAVPGPDFVL RFAALMHDVG
KPATRRFEPG GAVSFRHHDM VGSKLTKKRM KALRFDNDTI KAVARLVELH MRFYGYGEAG
WSDSAVRRYV TDAGPLLERL HRLTRSDVTT RNQRKADRLS FAYDDLEARI AALREQESLE
AVRPDLDGGQ IMALLGLKPG PVVGKAYKFL LEQRMEHGPL DPVVAEARLR EWWADQPEAA
EHQPAAGVEL STTEES