Gene Arth_0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0148 
Symbol 
ID4447411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp152412 
End bp153539 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID639687943 
Producthypothetical protein 
Protein accessionYP_829649 
Protein GI116668716 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCTA TGGAGTCAAC TGCAGGCGAT TCATCGGTAC AGGCCCAGGC GCTGATCAAC 
TGGGAGCTCG CGGCGTCAAC CGCTGCGCGG CTGGCTCCGG CAGGCCCTTC GCTGGGTTCC
GCCGAAATCG GGACGGCCGT GGAGAACCTG CGCCTGATGG CGGACATCTC CGTGCCGCAC
GTCCATGACA TCACCGGGCT GGAAGCCGCG CGGGACCTCC GCGATTCCTC CGTGCTGGTG
GTGGACCGCG CCTCCTGGGC CAAGGCCAAC ACCCAGAGCT TCACCGTCAT GCTGAAGCCG
GCGATGGAAA AGATGCTGGA GGGCCGCCGC GGAACCATGA GCCCCGGTGC GGCGTCCGTC
AGCGGCGCCA TCACGGGTAG CCAGTTGGGC GCCGTGCTCG CCTTCCTCTC CAGCAAGGTC
CTGGGCCAGT ACGATCCTTT CTCGGCACTC GCCGAAGACT CAACGGCCCC CGCCGGCGGA
CGCCTTCTGC TGGTTGCGCC GAACATCGTC CAGGTGGAGC GCGAACTCAA CGTTGCCCCC
GAGGACTTCC GGCTGTGGGT CTGCCTGCAC GAACAGACGC ACCGCGTGCA GTTCGCGGCC
GCACCCTGGC TGCGCCACCA CATGCTCAAC GAGATCGACA ACCTTAGCGA GCACCTGCTG
GGCAACGTCG ACACCCTCCT CGAGCGCGCG TCGGCTGCGG CCAAATCACT CAAGGACCGC
ACGGCCGCCG GAACGGCTCC CGGGCGCGGC GCTATCCTGG ACCTGCTCCA GGACCCGGAA
GAAAAAGCCT CCCTGTCACA CCTGACCGCC GTGATGAGCC TGCTGGAAGG CCACGCCAAC
GTGGTGATGG ACGCGGTCGA CGCCAGCATC GTCCCGTCCG TCAAGACCAT CCGGCAGCGC
TTCAACGCCC GGGGCAAGGA CCGGGGCGTC GTGGAGAAAT TCATCCGCAG CCTGCTGGGC
CTCGATGCCA AGATGCGCCA GTACACGGAC GGCGCCAAAT TCGTCCGCGC CGTGGTGGAC
GTGGCTGGCA TGGAAGGCTT CAACCGGGTC TGGGAATCCG CTGCGAACCT GCCCACGGAA
CCGGAAATCC ATGACGCCAA GCTCTGGCTC GAGCGGATGG GGCTCTAG
 
Protein sequence
MDAMESTAGD SSVQAQALIN WELAASTAAR LAPAGPSLGS AEIGTAVENL RLMADISVPH 
VHDITGLEAA RDLRDSSVLV VDRASWAKAN TQSFTVMLKP AMEKMLEGRR GTMSPGAASV
SGAITGSQLG AVLAFLSSKV LGQYDPFSAL AEDSTAPAGG RLLLVAPNIV QVERELNVAP
EDFRLWVCLH EQTHRVQFAA APWLRHHMLN EIDNLSEHLL GNVDTLLERA SAAAKSLKDR
TAAGTAPGRG AILDLLQDPE EKASLSHLTA VMSLLEGHAN VVMDAVDASI VPSVKTIRQR
FNARGKDRGV VEKFIRSLLG LDAKMRQYTD GAKFVRAVVD VAGMEGFNRV WESAANLPTE
PEIHDAKLWL ERMGL