Gene Arth_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4054 
Symbol 
ID4447785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4575990 
End bp4577306 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID639691885 
Productglycosyl transferase, group 1 
Protein accessionYP_833529 
Protein GI116672596 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTCC TTGTCTACCC ACACGACCTC GGCATCGGGG GAAGCCAGAT CAACGCCATC 
GAGCTCGCCG GTGCCGTTCA CCGGCTGGGC CACGAGACCA TCGTGTTCGG GCGGCCAGGT
CCGCTCGTTG AAAAAGTCCG TGAGTTGGGG CTGGAATTCG TGGCTGCCCC GGAGATGGGC
AGGCGGCCTT CCGTGGCCGT GACCCGGGCA TTGGCCGGGC TTGTTGAGAG CCGGTGTATC
GACATCCTGC ACGGTTACGA GTGGCCGCCT TCGTTGGAAT GCTATCTGGC GGCACGCAGA
CTGACCCGGG TGGCTGCGGT TTCCACGGTC ATGTCCATGG CCGTCGCGCC GTTCATCCCC
AAACATGTTC CGCTGACGGT CGGAACCCAT CAAATCGCGG AAGCCGAAGC CGGCATCGGC
CGCTCGGCCG TCACCGTGCT GGAGCCCCCC GTCGACGTGG ATGCGAACCG TCCCGGCCTT
GACCTTGCGC AGGGAGAGCT CCGGAGAAGG TGGGGTATCG CAGACGCCGG GCATGTTGTG
GCGGTCGTGT CCCGCCTCGC CCGGGAGCTC AAGCTGGAGG GCATCCTCAG CGCCATGGAG
GCGGTGGCAT CCCTCCCGGC CGGAATGCGG GTTTGCCTGC TCATCGCCGG TGATGGCCCG
GAGTGTGCGG AAGTTACCGA GAGGGCGGCA CAGATCAATC TCCGCACCGG CCGGCAGACG
GTTGTCCTGG CCGGTGAACT TGCGGACCCC CGGGCGGCTT ACGATGTGGC CGACGTTTGC
CTTGGCATGG GCGGATCGGC GCTGCGCGCG CTGGCCTTCG GCAAACCGCT CGTAGTCCAG
GGTGAAGAAG GTTTCTGGGA GCTCCTGACG CCTTCCTCAC TTGAGACCTT CCTGTGGCAG
GGATGGTACG GCGTGGGCAG CGGGCAGGCC GGCGGCGCGT CCACCCTCCG CCAGATACTG
TTTGAGATCC TGCCGGCCGA GGGTCTTCGC GCTGAACTCG GGGACTTCGG CAGGCGCGTT
GTGGTGCACC GCTATTCGTT AGGACACGCT GCGGAGGCCC AGCTGGCCAC CTACGCGGCC
GCCCTGGATG CGGTGTCATC CGGCCGACGA GCGACCTTCC GCGAACTCGA GGCTGCCGGC
CACTTCCTCC GCTACAAAAG CCGCCGGCTG CAGGCGCGCC TGACCGGCCG GGGCTCTGCT
GACGATTTCA ACGCCAGCCC CGTCGCAGCC GCACAACCGG TGCAGGCATT CTCTGCCGGA
GTTTCCGGCC CGGTTGCCAG GATGGCCGGC ACCAGTCCTT GGCGGAGCCG GCCATGA
 
Protein sequence
MRVLVYPHDL GIGGSQINAI ELAGAVHRLG HETIVFGRPG PLVEKVRELG LEFVAAPEMG 
RRPSVAVTRA LAGLVESRCI DILHGYEWPP SLECYLAARR LTRVAAVSTV MSMAVAPFIP
KHVPLTVGTH QIAEAEAGIG RSAVTVLEPP VDVDANRPGL DLAQGELRRR WGIADAGHVV
AVVSRLAREL KLEGILSAME AVASLPAGMR VCLLIAGDGP ECAEVTERAA QINLRTGRQT
VVLAGELADP RAAYDVADVC LGMGGSALRA LAFGKPLVVQ GEEGFWELLT PSSLETFLWQ
GWYGVGSGQA GGASTLRQIL FEILPAEGLR AELGDFGRRV VVHRYSLGHA AEAQLATYAA
ALDAVSSGRR ATFRELEAAG HFLRYKSRRL QARLTGRGSA DDFNASPVAA AQPVQAFSAG
VSGPVARMAG TSPWRSRP