Gene Arth_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1070 
Symbol 
ID4446441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1156077 
End bp1157381 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content68% 
IMG OID639688876 
Productglycosyl transferase, group 1 
Protein accessionYP_830564 
Protein GI116669631 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTCAAC GCTGCCCGGC TGTTAACCCA ACCTTCCTTT TCCGGTCATG CCCGCTCCCC 
CTCGGCGCAA CGCGCCGCCC GCACCGTGGA GGGGTGAGGA TCGCAATCGT TGCTGAATCA
TTCCTGCCAC TGATGAACGG GGTTACGCAC TCCATCCTTC GGGTGCTTGA GCATCTGCAG
GAGCGGGGCG ATGGCGTCAT GGTGATCGCC CCGTCGACAC AGGACACGGA GGTCCTGGAC
GTGGTGCACG GCGCGTTCGT GCACCGGCTT CCGTCGGTGC CGCTGGCCGG CTACTCGAAC
GTGCGGGTGG CGTTGGGCGG TGTGAACCGG GTCAAGAGAA TCCTTGCCGA TTACGCGCCC
GACGTTGTCC ACCTCGCGTC CCCGTTCGTG CTCGGCTGGC GGGCGGTGCA GGCCGCTCAC
CAGCTGGGGA TTCCCACAGT GGCCATCTAC CAGACCGAGG TCCCCAGCTA CGCGGCGCGC
TACGGTGTGC CGTTCATGGA GAACTGGGCC TGGAACCGGG TGGAGAACAT CCACCTGCTG
GCGTCCCGGA CGCTGGTGCC ATCGACTTTC GCGCTGAACC AGTTGCGCGG CCGCGGAGTT
CTGCGGGTGG ACATGTGGCG GCGCGGTGTG GATACCGCGC GGTTTGCGCC GGAAAAGCGC
GACGACGGGT GGCGGGCCTC CGTGGCCCCC GGCGGCGAGC GGATCATCGG CTATGTGGGC
CGTCTGGCCG TTGAAAAGCA GGTGGAGGAC CTGGCCGTGC TGGCCGATGT GCCGGGCACG
CGGCTGGTGA TCGTGGGCGA CGGACCGCAG CGCGAGGCGC TGCAGGAAGC CCTGCCGAAC
GCCGTGTTTG CCGGGTTCCT GGGCGGTGAG CAGCTGGCCA GCGCGGTGGC GTCCTTCGAC
CTGTTCGTGC ATCCGGGCGA GTTCGAGACC TTCTGCCAGA CCATCCAGGA GGCCATGGCA
TCGGGCGTGC CGGTGGTGGC CACGGGACGC GGTGGCCCGT TGGACCTGGT GGAAAATTCC
CGCACTGGCT GGCTGTACAG GCCGGGCGAC CTCGCCGGGA TGCGGGCACA TGTCATGGAC
CTGATGGGCG ACGACGCCAA GCGCCGCGCG TTCGCTGCGG CAGCGCACGC TTCGGTCCAG
GGGCGGACAT GGCCGGCGTT GAGCGCGGAG CTGGTGCGCC ATTACCGGGC TGTCATCGCC
GGTGAACCGG TGGTTGAGCC TGTCGGGCGA ATGCCGGTGG TTGAGCCCGT CCGACGGGTA
CCGGCGGTTG GGCCTGCCGA AACCAAAAGA GGAGCAACGC TGTGA
 
Protein sequence
MLQRCPAVNP TFLFRSCPLP LGATRRPHRG GVRIAIVAES FLPLMNGVTH SILRVLEHLQ 
ERGDGVMVIA PSTQDTEVLD VVHGAFVHRL PSVPLAGYSN VRVALGGVNR VKRILADYAP
DVVHLASPFV LGWRAVQAAH QLGIPTVAIY QTEVPSYAAR YGVPFMENWA WNRVENIHLL
ASRTLVPSTF ALNQLRGRGV LRVDMWRRGV DTARFAPEKR DDGWRASVAP GGERIIGYVG
RLAVEKQVED LAVLADVPGT RLVIVGDGPQ REALQEALPN AVFAGFLGGE QLASAVASFD
LFVHPGEFET FCQTIQEAMA SGVPVVATGR GGPLDLVENS RTGWLYRPGD LAGMRAHVMD
LMGDDAKRRA FAAAAHASVQ GRTWPALSAE LVRHYRAVIA GEPVVEPVGR MPVVEPVRRV
PAVGPAETKR GATL