Gene Arth_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4031 
Symbol 
ID4447867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4550450 
End bp4551877 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content66% 
IMG OID639691862 
Producthypothetical protein 
Protein accessionYP_833506 
Protein GI116672573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGG CCACGCCCTC AGGAAGCGGC GGACAGCTAC CACCGCTGGA GCCGGCAACG 
GACGGCAGCA TGCGGATCCG CGGCGGAGTC GGAGGAATCA GCTTTCAACT GGAAGAGCTC
ATGGCCGGTG TGGCTGAACT TGACGGCATG GCTGACGAAC TCGCCGCGGT TGAGATCGGG
GTCCGGCGGA TCTGGGAAGA ACTGTGTCCC TACGAGAACG ATCCCCGAAC AAGCGGCACG
GCCGCACTGA TTGCGGTGGG TGAAGGGGGC CAGGCGGTCC GCGCCGTCAG GGAAGAACTC
CAACACCTCA GCAGCCAGGT CCGGGCGAGC CGCCGGGACT ACGAAACGGC AGAGTCATTG
GCCGCCGCGG GGGTCCGCAT GCCGGACGAC GGCTGGGGCT TCCTGCCTGC GCTCTATCTG
GACCTGAAGA CCACGTTTGT GCCGAGCCGC GACGCTGCCG AGGCGCTCGC GGCGCCTTTG
TCACTCGCAC TGATGATAGG AATTTCCCCT GCGGAGCTGG CTCGCGCGTT GGCAGCCGAG
GTTGCAGCGG GCCGGGGTTT CCTGGCCATT GGCCCGTTGA TGCGGAGGCT GGCGGAGGGC
ACGCTTCCCT TCCTCAAGCC GCGTCCCGTC ACGGCCGTCG AGGAACTCAC CCGGGATGTG
GTCGTGGACA CCTCGCCCGC CGGCCTCCTC GCCAGGCTTC GCGAACTTGA CGCTGAGGGC
CACGGAAAGA TCGAGGTGGT GCAAGTGGAG GCGGACGGCC GAAAAGCGTA TATCGTCATC
ATCCCGGGAA CCCAGCCCGG GGATCCGGCA GGGGGTTCGA ACCCGCTGGA TGAGGCCGGG
ATTGCTGAGG CGCTGGGCTA TGGCTCGGAA TACCTGAATG CTGCCGTGCT GTCGGCATTG
CACCAGGCCG GTGCAGTCAA AGGGGATCAG GTAGTGGCTG TGGGCTACAG CCAGGGCGGG
GCACATGCCA TGAACCTCAG CAGTGACAAG GCGTTCCTCG CCGAATTCGA CCTGAAGTAT
GTGCTGACGG CCGGTTCACC GGTGGGCGCG ATTTCGCCGG CACCGGGAAT CACGTCCCTC
CACCTCGAAC ATCGCCAGGA TTGGGTTCCC GGTAGTGACG GAACTCCTAA CCCGGACACC
AGGGAAAGGG TCACCGTCAC GCTGACCGAC AGGGTGTTCA GGCCGCCGGG TTTTGATCTC
AACCTGGGGC CGGGCCACAA CATTGGCAAC TACGAGGAGG GCGCCAAGGC AGTGTCGGCC
AGCAAAGACC CGTCCCTGGT CGCGAACACG GCGGTCCTCG CCGGCGTTGT TGGCGCGGGA
GGGGCAGGGA CCGCCACCCG CTTTGCCGTA AACCGGGAAC CGAAGGCCCC GACGGCGCGC
CAGCAGGACA GGCCGCTCCA AGGGGCAGCC AGGTGGGTCG GCCGCTAG
 
Protein sequence
MAEATPSGSG GQLPPLEPAT DGSMRIRGGV GGISFQLEEL MAGVAELDGM ADELAAVEIG 
VRRIWEELCP YENDPRTSGT AALIAVGEGG QAVRAVREEL QHLSSQVRAS RRDYETAESL
AAAGVRMPDD GWGFLPALYL DLKTTFVPSR DAAEALAAPL SLALMIGISP AELARALAAE
VAAGRGFLAI GPLMRRLAEG TLPFLKPRPV TAVEELTRDV VVDTSPAGLL ARLRELDAEG
HGKIEVVQVE ADGRKAYIVI IPGTQPGDPA GGSNPLDEAG IAEALGYGSE YLNAAVLSAL
HQAGAVKGDQ VVAVGYSQGG AHAMNLSSDK AFLAEFDLKY VLTAGSPVGA ISPAPGITSL
HLEHRQDWVP GSDGTPNPDT RERVTVTLTD RVFRPPGFDL NLGPGHNIGN YEEGAKAVSA
SKDPSLVANT AVLAGVVGAG GAGTATRFAV NREPKAPTAR QQDRPLQGAA RWVGR