Gene Arth_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3204 
Symbol 
ID4444194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3610047 
End bp3611273 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content51% 
IMG OID639691028 
Productglycosyl transferase, group 1 
Protein accessionYP_832680 
Protein GI116671747 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGTCG AAAGTTTGAA TCTGATTGAA AATCGAACTC GGTACTCTAA ACGTACTGAT 
CAAACGGGAG GACCTGGAAC CAGCATGAAG TTTGTCATCG CATCCCACAG TGAGAGGCTC
GCCGGTGCCG AACGTTCTCT TTTCGCCATT GTAGCCGCAG CCCTTGAGGC TGGACACGAG
GTCGTCGTCA CTGTCCCACA GCAGGGTGCT CTGCAATCAA AAATAAATGT GACGTTTCCC
GACGTGCAGG TCATCGAAAT TCCGACGCAC TCTTGGATGC ACGGTTCAAG ATTCACTTTC
AAGTCGGTTC CGAGGACTTT GACTTCGATA TTGGAGTCAA TCGTTCACGC CAGGCTTTAT
CGTCAAATAT CGCCTGATTT TATCGTCATC AACTCATCCG TTATCCCAGC GCCAATGATA
GCTGCTGCGC TTTGTCGAAT ACCGTCGATA GTAATGGTTA GAGAATCTAT CAGGACAAAT
ACTCAACTAT TTTCTATAGT ACCCAAGAGC ATTTTAATAC GATTGATTGA AGGAATGTCT
ACCTTCCGCT TTGCGGTTTC ACATTACGTT GCGGACCAGT TGAATCAACC GTGCACGGTT
GATTTCCCAG ATGTTAGGCG CGACTTGGGC ATAGAGTCCC TTTGGCCCAC AGACAATGAG
GCGAAACCCA CTCGGGCACG AGCGCTTCGC GCCGTGATGC TGGGCTCGTT CTCGCCGGAA
AAGGGCCAAG ATGATGCCAT TCAGGCGGTG GCTTTAGCGC GGGCAGCCGG AGTTCAAATC
GACCTCTCAC TGTATGGCTA TGCGCACGAA AGCGAAATTT TAAAATTGCA GGAGTGGTGC
GACCGCCATG GTTTGAGTGA TCGAATCAGA CACAAAGGTT TCATCGATGA TCCTAAAGAG
GCCTACGGTT CGGCGGATGT GTCGCTGGTT TGTTCCAAAA ATGAGGCTTA CGGAAGGGTG
ACGGCGGAGT CGCTACTAAT GGGGGTACCC GTTGTGGGCT ACGAACTCGG TGGTACAACG
GAAATCCTTA GGGCTGGTGG CGGAATTTCC TGCAAACCGA CATCTACAGA CCTAGCAAAT
GTCCTAGTTT CATTAGCGGA AGACCCCAAC CTTCTGAATG ACCTCCATTC GCAGTGCCGG
TCCCTCCGCG CTGACAGCGG GGAATTTGGG AATTCGGGGA GAACTGTTTC GCGTATGGTG
GAAAAGATCA TCGGCGTTGG TGGCTAA
 
Protein sequence
MHVESLNLIE NRTRYSKRTD QTGGPGTSMK FVIASHSERL AGAERSLFAI VAAALEAGHE 
VVVTVPQQGA LQSKINVTFP DVQVIEIPTH SWMHGSRFTF KSVPRTLTSI LESIVHARLY
RQISPDFIVI NSSVIPAPMI AAALCRIPSI VMVRESIRTN TQLFSIVPKS ILIRLIEGMS
TFRFAVSHYV ADQLNQPCTV DFPDVRRDLG IESLWPTDNE AKPTRARALR AVMLGSFSPE
KGQDDAIQAV ALARAAGVQI DLSLYGYAHE SEILKLQEWC DRHGLSDRIR HKGFIDDPKE
AYGSADVSLV CSKNEAYGRV TAESLLMGVP VVGYELGGTT EILRAGGGIS CKPTSTDLAN
VLVSLAEDPN LLNDLHSQCR SLRADSGEFG NSGRTVSRMV EKIIGVGG