Gene Arth_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0033 
Symbol 
ID4447503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp39379 
End bp40515 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content64% 
IMG OID639687827 
Producthypothetical protein 
Protein accessionYP_829534 
Protein GI116668601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGAG CCGAAGTCGA GATTGTCTGG ACTGCCCTGT CAAAGGCGTG GCGGAAGGCC 
CAATACGAAG AACTGTGGCT GCAGTTCCGG CCCCGTGGCT CGAAGGATGC CTTTGAGGCA
GTTGAGGCTG CCGTCCAGAT GGCGAACCAG GCCGTCGATG ACGCCGGCTG GGGCCATGAC
GTGTTCACAG CAGGGGTCTC GGATTCGGAC GCCGGTCCGG TGGCGCTGAT GAGCAGGGCC
GGGACCGAGG AAGGGGTGCG CGCCTGGTTC GCTGCTTTCG CCCAACATTT GGAGACCCTC
GGGAAATCCG GGAGGGTCAC CGCCGCGCCG GAGGCGTTTT TCCCGGAATG GTTGAGCAGC
GGACAGGTCC CGCAGCAGCT GACCGCGTAC GTGTCCTACC AGACCAAGGA CTTGGCCCTG
CTGGACGAAG ACGAGCAGCG CCGCGCCTGG CACGTCCCGA AGGCCCTAAC AGCACAGATC
GCCGACGCTG CTACGTCGTG GGGCCGCTTC GAGGGAGCCG ATGTCTACCG CAATATTCAC
CAGATCCGCT CCAAGAACCC TGACGTTGGG GCCGCGATGG CGGCCGGGGT CGAGAAATTC
GGCATGGCCG GCGTGACCTA CCTGCGGTCC GAGCCGCGCC GCTTCACGTG GGCTTCGATG
AGCCCCCTGG GACGGACCTG TTACGGGGTT ATGGACGACA CGATGTCCTG GCAGGAGCGC
TTGGCTCAGG TCACCCGGGC CATGACCGCG TTCCCAAGCG ACACCGATCT GGCATTCGTA
CGGCACAGCA AAGCGCTCAC CATCTCCTGG GGTGACCTCG CCGGTGCCAG GCCGGCTCTT
CCCCACGTCA AGGAGTATCA CGTCCGCTAC AACAGGCACC TAAACCGGCA GTACATCCCC
GACGCCCACG GAATCCAGCT CCTCACCGAC GCCCACTTGG AACAAGCCAA CGACCTCTCC
GACTGGAACA TCACCGGCCT TGGTGGGGGC AAGCATCTGG TGGAGGCCAA AGATCTGCAG
CTCTGGTATG CCAATATCGA CCCCGAGCCG GAAACACTGG CACAGGCACG CGCTGATTTC
TCCGGGATGA TCCTCACCCC GGAGATCATT GCAAAGAATC CTCCTCCTTG GCGCTAA
 
Protein sequence
MPGAEVEIVW TALSKAWRKA QYEELWLQFR PRGSKDAFEA VEAAVQMANQ AVDDAGWGHD 
VFTAGVSDSD AGPVALMSRA GTEEGVRAWF AAFAQHLETL GKSGRVTAAP EAFFPEWLSS
GQVPQQLTAY VSYQTKDLAL LDEDEQRRAW HVPKALTAQI ADAATSWGRF EGADVYRNIH
QIRSKNPDVG AAMAAGVEKF GMAGVTYLRS EPRRFTWASM SPLGRTCYGV MDDTMSWQER
LAQVTRAMTA FPSDTDLAFV RHSKALTISW GDLAGARPAL PHVKEYHVRY NRHLNRQYIP
DAHGIQLLTD AHLEQANDLS DWNITGLGGG KHLVEAKDLQ LWYANIDPEP ETLAQARADF
SGMILTPEII AKNPPPWR