Gene Arth_3912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3912 
Symbol 
ID4444552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4406829 
End bp4408568 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content66% 
IMG OID639691737 
Productprotein of unknown function DUF853, NPT hydrolase putative 
Protein accessionYP_833387 
Protein GI116672454 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCA AATCCACTGC AGATAAAGTC GCCACCATCC AGAAGGGATA CACCCTGGAC 
GGCGCCACCA TCGAACTGGG GGCCGCGATC GTCGACGGCG AGCTCCACAA GGACGCCCCC
GTCCGGCTGC CCCTGGCCAT GATGAACCGG CACGGACTGG TGGCCGGCGC AACAGGTACC
GGCAAGACCG TCACGCTCCA CATGATGGCG GAGCAGCTGT CCACGGCCGG GGTGCCGGTG
TTCCTCGCCG ACATCAAGGG AGACCTTTCC GGGCTGGCCA CCGCCGCAAC CGGCAGCGAA
AAACTGAAAG CGCGCACGGA CAGCATCGGC CAGGCCTGGG CGGGCAGAAC TTTTCCCGTG
GAATTCCTGG CTCTCGGCGG CGACGGCAAC GGCATCCCTG TCCGGGCCAC CATCACTTCT
TTCGGCCCCA TCCTGCTCTC GCGGATCATG GAGCTCAACG ACACCCAGGA ATCCAGCCTG
CAGCTCGTTT TCCACTTTGC GGACAAGAAC AACCTGGAAC TGATCGACCT CAAGGACCTC
AGGGCGGTCA TCCAGTTCCT CACGTCGGAC GAGGGCAAGG ACGAACTCGA GGCGCTGGGC
GGGCTCTCCA AGGCGACGGC CGGCGTCATC CTCCGCGAAC TGGTGACCCT TGAGGCACAG
GGCATGGAAG CATTCTTCGG CGAGCCCGAA TTCGACACCG CCGAACTGCT GCGCACCGCC
CCTGACGGCC GCGGCGTCAT CACCTGCCTG GAACTGCCCA CGCTGCAAAC CAAGCCCATG
GTGTTCTCCA CCTTCCTGAT GTGGCTGCTC GCGGACCTGT TCGAGGACCT GCCCGAAGCC
GGGGATCTGG ACAAGCCCAA ACTGGTCTTC TTCCTCGACG AGGCACACCT GCTCTTCAAC
GATGCCTCCA AGGCGTTCCT GGAGGCGATT ACCACCACTG TCCGGCTCAT CCGTTCCAAG
GGCGTGGGCA TCTTCTTTGT CACCCAGACG CCCAAGGACG TGCCGGCCGA TGTCCTGGGG
CAGCTGGCAA ACCGCATCCA GCACGCCCTG CGCGCGTTCA CCCCGGAAGA CGCCAAGGCC
CTGAAAGCCA CCGTGTCCAC GTTCCCGGTG AGCGACTACG ACCTCGAGGA AACGCTGACC
TCGGCCGGAA TCGGTGAAGC CGTCATCACG GTGATGAATG AAAAGGGCGC CCCCACCCCG
GTGGCATTGA CCCGCCTCCG CGCCCCGGAA TCCTTGATGG GCCCCAGCAC GGAGGACCTC
GTCAAGAGCA CCGTGGCCGG TTCCGCGCTG CTCGTAAAAT ACGGCACGGC CGTGGACAAG
GTCTCCGCCT ACGAGAAGAT CTCAGGAAAG GGCGCCGCCC CCACGGGAGC CGCGGCGCCC
GGGCAGCCTC CCGCTCCGAA CTCGGAAATT TTTGTGCCCA ATAGCCCCGC GCCCGGTAGC
CCTGTTCCCG GAACCATGGA CCAGGCCTCC GTGGACGCCG ATGCCCGGCG CATCGAGGAA
GATATCCTGG GCCGCCCCAG CAGCAGACCC GCCCCGGTAC CGGAACGGCC ACGCAGCGGC
GAACGCACAG CGCCGCAGGC CCGGAAGGAA TCAGGCGGAA GCATGGCTGA CGATCTCGCC
GGAGCCTTGG GCGGCGCGCT GGGCGGCGGC CTCAAAAGCA TGGCCCGCTC GCTCGGAACC
CAACTGGGCC GGGAGCTGTT GCGGGGCGTC TTCGGCACGT CCTCCCGCCG CCGCCGGTAG
 
Protein sequence
MAIKSTADKV ATIQKGYTLD GATIELGAAI VDGELHKDAP VRLPLAMMNR HGLVAGATGT 
GKTVTLHMMA EQLSTAGVPV FLADIKGDLS GLATAATGSE KLKARTDSIG QAWAGRTFPV
EFLALGGDGN GIPVRATITS FGPILLSRIM ELNDTQESSL QLVFHFADKN NLELIDLKDL
RAVIQFLTSD EGKDELEALG GLSKATAGVI LRELVTLEAQ GMEAFFGEPE FDTAELLRTA
PDGRGVITCL ELPTLQTKPM VFSTFLMWLL ADLFEDLPEA GDLDKPKLVF FLDEAHLLFN
DASKAFLEAI TTTVRLIRSK GVGIFFVTQT PKDVPADVLG QLANRIQHAL RAFTPEDAKA
LKATVSTFPV SDYDLEETLT SAGIGEAVIT VMNEKGAPTP VALTRLRAPE SLMGPSTEDL
VKSTVAGSAL LVKYGTAVDK VSAYEKISGK GAAPTGAAAP GQPPAPNSEI FVPNSPAPGS
PVPGTMDQAS VDADARRIEE DILGRPSSRP APVPERPRSG ERTAPQARKE SGGSMADDLA
GALGGALGGG LKSMARSLGT QLGRELLRGV FGTSSRRRR