Gene Arth_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4122 
Symbol 
ID4447642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4638558 
End bp4639739 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID639691953 
Productglycosyl transferase, group 1 
Protein accessionYP_833597 
Protein GI116672664 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAAA CCGGCAGTCT CGACTGCAAG CCAGGGCAAC AGGTTTTCGT GTATGTCCCC 
GGGAGCCGCT GGGAGAACGT TCAGGGAACC GACCACCGGC TGGCGGCAGC TCTGGCGAGC
CAGGTAGCGG TGCTGTGGGT CGATCCGCCG CTCCCGGTTC ACCTGGCGTT CCGGCATGGG
ATAAACGCAC TGCGCGTCCG CAACGAACTG AGCAACGTCG CACCCGGCAT CACCCGCTTG
CGTTCACTTT CCATCCCCGG CTTCACCCGC GCGGTCCTTG CCACTGTTGC CAAGGGCGTA
TTGGGGCATG CCATCAGGTC CGCTCTCCGG ACCATGAAGG CTACCCCCGT CGCCGTGATG
GTCTCATCGC CCACCTCCGG TTTCCCCACC CGGCTGGCGG GCCGGAAGAT CCTTTTTGTC
ACCGATGACT GGGTGGCCGG AGCGCCATTG ATGGGCCTGT CCGGTCCGTT GGTGCGCCGT
ACGCTGCGCC GGAACCTCCG CGAGGCGAAC ATCGCCGCGG CCGTGTCCCC GCATCTGGCC
GAAAACCTGG AGGCGAGCTT CCCTGACCGC CCGGCCTCCG TCGTCGTCCT GCCTAACGGC
TGTGATCCCG GGAAGGACGC CCCGCTTCGC GTCGAACGCT CCGACAACGC CGCCCTCGTG
GGCCAGCTGA ACGAAAGGCT GGATATGGAT CTGTTGGAGG CGGTCACGGA TGCGGGGGTC
CCGTTGCTGG TCATCGGTCC CCGGACCGAA CGCGACCCGG AGACCGGCCG GCGCCTCGAC
CTCTTCCTGG CTTCCGAGAA CGTCACCTGG CTCGGTGAGC TTCCGACCAC GGAACTGGGG
CAGCACCTGG CGGCAGCGGG CGTGGGCCTG ACTCCGTACG CCGACACCCC CTTCAACCGG
GCGAGCTTCC CCCTAAAGAC TTTGGAGTAC CTTGCCGCCG GCGTACCGGT TGTCGCCACC
GACCTGCCGG CCGTCCGGTG GTTGAACACG GAACTGGTGA CCGTCGGCAG CGGCCGTGAC
GAGTTCGCAA AGCGCGTTCA GCAGGCACTG GCCGGCCCGC ACGATCCCCT GGCGGAGGAA
CAGCGCCGTC ACTTTGCGGC ACTCCACACC TGGGAGGCAA GGGCCAACCA GCTCCTTGAC
ATGGTGGGCC CGCATGGCCA GGCAGGAGGC ATGGCCGCGT AG
 
Protein sequence
MLQTGSLDCK PGQQVFVYVP GSRWENVQGT DHRLAAALAS QVAVLWVDPP LPVHLAFRHG 
INALRVRNEL SNVAPGITRL RSLSIPGFTR AVLATVAKGV LGHAIRSALR TMKATPVAVM
VSSPTSGFPT RLAGRKILFV TDDWVAGAPL MGLSGPLVRR TLRRNLREAN IAAAVSPHLA
ENLEASFPDR PASVVVLPNG CDPGKDAPLR VERSDNAALV GQLNERLDMD LLEAVTDAGV
PLLVIGPRTE RDPETGRRLD LFLASENVTW LGELPTTELG QHLAAAGVGL TPYADTPFNR
ASFPLKTLEY LAAGVPVVAT DLPAVRWLNT ELVTVGSGRD EFAKRVQQAL AGPHDPLAEE
QRRHFAALHT WEARANQLLD MVGPHGQAGG MAA