Gene Arth_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1239 
Symbol 
ID4446268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1362956 
End bp1363939 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content66% 
IMG OID639689047 
Productglycosyl transferase family protein 
Protein accessionYP_830733 
Protein GI116669800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.289253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACG GCCGCGTAAC GGTGGTGGTC CGGACCAAGA ACCGCCCCGT CCTGCTCCAC 
CGCGCCCTGG AGGACATCCT GGGCCAGTCC TACCAGGACT TCTCCATCGT GGTGGTCAAC
GACGGCGGGG ATCCGGCTCC GGTCGACGCG CTGGCGGAAG GATACAGCCA CCTCCCGGCG
GGGAAGCTGA AGGTGCTCCA CCATGCACAG AGCAAAGGCA TGGAAGCAGC CAGCAACGCA
GGCATTGCCG CCGCAACGTC GGAGTACGTC GCCGTGCATG ACGACGACGA CCGGTGGCAC
CCGGATTTCC TGCTCAAGAC CGTTGGCTTG CTGGACGGGA AGCCCGCCGC GCACGGAGTT
GCCGTCAGGA CCAATGTTGT ATACGAGGAA GTCCGCGACG GCGAGATCGT GGAGACGGGC
TCCTTCGCGT ACTGGCCCGA GCTGCGGGCC ATCACGCTGA CGGACATGCT CCGGATCAAC
AGGATCGTCC CCATTTCCTT CCTGTACCGC CGCTCCGTGC ACGACCACGT GGGGTTCTAT
AACGAGGAAC TCGACGTCGT GGGGGACTGG GAGTTCTACC TGCGGTTCCT GCAGGCCTAT
CCGATGGAAC TGCTCGACGA CGAGCCGCTG GCCTTCTGGT GCCAGCGGCC GGCAGCGAGC
GGAGACATGG GTAACAGCGT TATCGCCGCG GCCGACGAAC ATGCGAAGTT CGACAGCCTC
GTGCGGGATG CGTTCCTGCG GCGCGAAGCC GGCAAGACTG GAGTTGGTTA CCTTCTCTAC
CTGGCCCAGC TGAGCGGGCA ACAGGAGGAA GCCGCTGCAG AGGCCCGGGC GCTGGCCGAC
CGGGTGGTAT CCACCCTGGA GGACCTCAGC AGGCGCATTT CGGTCCTGGA GGAGACAGTG
GTCCGGCGGA CCAGCGTCTT CGAGTTCGTC GGCCGCCCGG CCCGCGTTGC AGCGCGGCTC
TGGAAATCCC GCCGGAGGGA TTAA
 
Protein sequence
MADGRVTVVV RTKNRPVLLH RALEDILGQS YQDFSIVVVN DGGDPAPVDA LAEGYSHLPA 
GKLKVLHHAQ SKGMEAASNA GIAAATSEYV AVHDDDDRWH PDFLLKTVGL LDGKPAAHGV
AVRTNVVYEE VRDGEIVETG SFAYWPELRA ITLTDMLRIN RIVPISFLYR RSVHDHVGFY
NEELDVVGDW EFYLRFLQAY PMELLDDEPL AFWCQRPAAS GDMGNSVIAA ADEHAKFDSL
VRDAFLRREA GKTGVGYLLY LAQLSGQQEE AAAEARALAD RVVSTLEDLS RRISVLEETV
VRRTSVFEFV GRPARVAARL WKSRRRD