Gene Arth_2693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2693 
Symbol 
ID4444781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3020678 
End bp3022693 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content62% 
IMG OID639690513 
Productglycosyl transferase family protein 
Protein accessionYP_832172 
Protein GI116671239 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.316585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTCT CAACCGAAAG CCCGAAGGGC ACCGACAGCC AGGAGTCCAG TGAACAGCGG 
TGGGACACGC TCCAGCGCGT AATCCTGCCG AGTTCCAGCC AGATGGACAC GGTCCCGTTG
TACATGGACA TGGGAACGGC CACGGGTGTC CAGCTGCCCA CGGTCGGGGA TCGTGACGGC
AAGGCAAGCA AGCCGCAGGC CTTCAGCAGT CCCACGAAGG AAGCCCACGT GGAGGACTTC
CTGTCCCGTT TCTCAACGTC CGTGCGGTCC GGGGAACGTG TCTCCTTCGG CAGTTACTTC
AACGCTTTTC CTGCCAGCTA CTGGCGGCGT TGGACCAATG TCGAGAAAAT CCGGCTTCAT
GTCCGCACCC AGGGTGCCGG GTCCGTCATC GTCTATAAGT CCAACGCCCG CGGATCACTC
CAGCGTGTGG ACACCCGCAG GGTTGAGGGA ATCGCGGAGA ACTTCTTCGA TCTATCCCTG
GCACCGTTTG GTGACGGCGG CTGGTACTGG TTCGACCTCG TGGCCGGCTC GGAACCCCTC
GTCATGCTGG ACGCGGAATG GCAGGGTCCT GCAGCGGACA CCCAGCCTGG TTCGGTGACG
CTGCAGATCA CCACCCTGAA CAAGACTGAT TTCTGCCTCA ACAACCTGCG GCTCCTCGCT
GAGAACGCCG AGGCGCTGGA GCACGTCAAG GAAATCCTGA TCGTGGACCA GGGTTCGCAG
AAGGTTGCGG AAGCGGAAGG CTTCGCGGAG GTCCGTGACT CCCTGCAGGG CAAGCTGCGG
ATCATCAACC AGTCCAACCT CGGCGGCTCG GGCGGTTTTG CGCGCGGCAT GTTCGAAGCC
GTGGAAAACG GCAGCGATTA CGTGCTGCTA ATGGACGACG ACATTGTCGT GGAACCGGAA
AGCATCATCC GCCTGCTGAC GTTTGCGGAC CGCTGCAAGA CGCCGACCAT CGTCGGCGGA
CACATGTTCG ACCTGTACAA CCGGACCGTG CTGCATACTT TTGGCGAGAT TGTGAACCCC
TACCGGTTCC AGCCGTCGCT ACAGAGCGAA GAGATGATCC TCGGGCACGA TTTCATGTCT
TCAAACCTCC GGCAGACGTC CTGGCTGCAC CGCCGCTGCG ATGTGGACTA CAACGGATGG
TGGATGTGCC TCATTCCCAC GAAGGTGATT CGCGAAATCG GGCTTTCACT CCCGCTGTTC
ATCAAATGGG ACGACTCCGA ATACGGTCTT CGGGCGAAGG CCCACGGCTT CCCAACGGTC
TCGCTGCCCG GCTCCGCAGT CTGGCACGTG TCCTGGATCG ATAAGGACGA CCTTGTGGGC
TGGCAGGCGT ACTTCCATGC ACGCAACCGT GTCATTGCTG CGCTGCTGCA CAGCCCCTAT
GAACATGGCG GACGCGTGGT CCGGGAATCC CAGTACATCG ACGTCAAGCA CTTGGTATCG
ATGCAGTACG CCACAGCGCA CGGCCGCGGC TGGGCGCTCG AAGACATCCT GAAGGGCCCG
GAGGCCCTGC GGGAGCTGCT CCCGTCCAAG CTGCCGCAGA TCCGGGAAAT GATGTCGGGT
TACTCGGACT CCGTCGTGCG CCCGGACCCG GATGACTTCC CTGCACCGAA GATGGACAAG
CCCCCGCGCC GGGGTCACGG AATCTCGCAG CCGTCCAAGG TATCGCTGCT GCCGTGGGCC
GCCAAGACTG TCATCCGGCA GCTTGCCGCT CCGGTGAGCG GTTCAAGCGC GGAGCGGCCG
CAGGCCACCG TGGCCCACCA GGACAACCGC TGGTGGCGGA TGGCTCAGTA CGACAGCGCA
ATAGTGTCCA ACGCTGAAGG AACGGGCGCA TCGTGGTACC GGCGGGATCC GAAACAGCTT
CGAACGATGC TGGCTGAAAG CGCGCGCCTC CACTCCCAGC TCCTGCAGAA CTGGCCGGCA
CTCAGCAAGA AGTACAAGGC CGCAATGAAC GACCTCACGT CGATTGAGTC CTGGAGGAAA
ACGTTCGAGC AGCACACTCA GAACGAGATC AAGTGA
 
Protein sequence
MSVSTESPKG TDSQESSEQR WDTLQRVILP SSSQMDTVPL YMDMGTATGV QLPTVGDRDG 
KASKPQAFSS PTKEAHVEDF LSRFSTSVRS GERVSFGSYF NAFPASYWRR WTNVEKIRLH
VRTQGAGSVI VYKSNARGSL QRVDTRRVEG IAENFFDLSL APFGDGGWYW FDLVAGSEPL
VMLDAEWQGP AADTQPGSVT LQITTLNKTD FCLNNLRLLA ENAEALEHVK EILIVDQGSQ
KVAEAEGFAE VRDSLQGKLR IINQSNLGGS GGFARGMFEA VENGSDYVLL MDDDIVVEPE
SIIRLLTFAD RCKTPTIVGG HMFDLYNRTV LHTFGEIVNP YRFQPSLQSE EMILGHDFMS
SNLRQTSWLH RRCDVDYNGW WMCLIPTKVI REIGLSLPLF IKWDDSEYGL RAKAHGFPTV
SLPGSAVWHV SWIDKDDLVG WQAYFHARNR VIAALLHSPY EHGGRVVRES QYIDVKHLVS
MQYATAHGRG WALEDILKGP EALRELLPSK LPQIREMMSG YSDSVVRPDP DDFPAPKMDK
PPRRGHGISQ PSKVSLLPWA AKTVIRQLAA PVSGSSAERP QATVAHQDNR WWRMAQYDSA
IVSNAEGTGA SWYRRDPKQL RTMLAESARL HSQLLQNWPA LSKKYKAAMN DLTSIESWRK
TFEQHTQNEI K