Gene Arth_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3658 
Symbol 
ID4443659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4110102 
End bp4111151 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID639691482 
Productglycosyl transferase family protein 
Protein accessionYP_833133 
Protein GI116672200 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTACAG TTATTATCCC TGCGCATAAT GAAGCTGCAG GTATTTCTGA CACGCTCGAA 
TCCCTCAAAT CCCAGACTCA GCCGCCGGAC AGGGTGGTGG TGGTTGCCGA CAAATGCACC
GATGCCACCG GGGAAATCGC ACTGGCGCTC GGCGCAGAGG TCATCCGCAC GGTGGGAAAC
ACAGATAAAA AAGCCGGCGC CTTGAATTTT GCGCTGGAGG GCCTACTGCC GGGCGCGAAT
CCGGAAGACA TGATCCTCGT CCAGGACGCG GATTCGCAGT TGAGCCATGA CTTCATCGAG
CGGGCAACCG CTCACCTGCG TGCCGACAGG CGGCTTGGCG CCGTGGGCGG CGTCTTCCGC
GGCGCCGACG GCGGCGGATT CGTGGGTCAC CTTCAGCGTA ATGAGTACGC ACGCTACGCC
CGGGACGTGA AGCGGCTTCA CGGCAAGTGC CTTGTGGTGA CCGGAACGGC CGCGCTCTTC
CGCGTCCGGA CCTTGGAGGA TGTCATCGAA GCCCGGCTTG ACGGCACGCT GCCGCCGGGT
AACTGCAGGG GAGGCGTTTA CGACACCTCC GTCCTGACCG AGGACAACGA GTTGTCCTTC
GCGCTGCTGA CCCTCAACTA CCGCATCAAA TCGCCGGCCG ACTGCACGCT CGTCACCGAA
ATCATGCCGA CCTGGCGTGA GCTCTGGGCA CAGCGGCTGA GATGGAAGCG CGGAGCCGTG
GAGAACTGTG TCCAGTACGG CTGGACCAGG GTGACCCGGC CGTACTGGGG GAGGCAGGCG
CTCTCCGTGA CAGGCATTGT GGTGTCGTTG GCCTACTTCG GAACGGTGGC TTTTGCACTG
GGCACGGGAG AAGGGCTGCA CATTCAGCCC TTCTGGATGG CCGTGACCGG TGTCTTCGTG
ATCGAACGGG TAGTGACTGT GCGGCTGCGT GGCTGGAAGT ACATGCTCGC CGCCGCAACG
ATGTACGAAC TTCTGATCGA CCTGTTCCTT CAGGTAGTCC ACGCGAAGGC TTACGTGGAT
GTAGCACTCA ACAAAAAGAA AGCTTGGTAA
 
Protein sequence
MITVIIPAHN EAAGISDTLE SLKSQTQPPD RVVVVADKCT DATGEIALAL GAEVIRTVGN 
TDKKAGALNF ALEGLLPGAN PEDMILVQDA DSQLSHDFIE RATAHLRADR RLGAVGGVFR
GADGGGFVGH LQRNEYARYA RDVKRLHGKC LVVTGTAALF RVRTLEDVIE ARLDGTLPPG
NCRGGVYDTS VLTEDNELSF ALLTLNYRIK SPADCTLVTE IMPTWRELWA QRLRWKRGAV
ENCVQYGWTR VTRPYWGRQA LSVTGIVVSL AYFGTVAFAL GTGEGLHIQP FWMAVTGVFV
IERVVTVRLR GWKYMLAAAT MYELLIDLFL QVVHAKAYVD VALNKKKAW