Gene Moth_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2358 
Symbol 
ID3832538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2478170 
End bp2480179 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content55% 
IMG OID637830278 
Productglycosyl transferase family protein 
Protein accessionYP_431184 
Protein GI83591175 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00895001 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAATGA GACCGAAATC CGATAAACGT TACGATATGA TCCTGGCGGG TATCGCCCTA 
TTATCGGCTT TCCTCAACAT CTTCAACATC TGGGAAGATC GCTACGCCAA CGCCTACTAC
ACGGCAGCGG TTACCAGCAT GCTGCAGAGC TTCCACAATT TTTTCTACGC TTCTTTTGAT
CCGGCCGGAT ATGTTACGGT GGATAAACCG CCGCTGGCGC TCTGGATCCA GACGATATTC
GCCTATTTCT TCGGGGTGCA CGGATGGAGC GTAGTTCTCC CCCAGGCCCT GGCCGGCGTC
GGTTCCGTCT TGCTGATATA TCACCTGGTC AAGCCGACCT TCGGGAAAAC GGCAGCCCGG
ATCGCGAGCC TGGTTATGGC CTGTACGCCA ATCGCCGTAG CGGTGAGCCG GACGAACAAT
ATTGACAGCT TGCTGGTATT TGCGCTCCTT ATAGCAACCT GGATGTTGTT CCGGGCGGTC
CGTGATCAGA AGCCGGTCTG GGTTCTCGGA GCCTTTGCCA TGATCGGTGT CGGCTTTAAT
ATAAAGATGC TTCAAGCTTA CATGGTGGTA CCGGCCTTTT ACGTTTTTTA CCTGCTAGCC
TTCAAGAACG AATGGAAGAA GAAACTGGCC CTCCTGACGG CGGCCACAGT GATCATGGTA
GGAGTTTCTA TATCCTGGGC AGCCGTCGTA GATATGACAC CCCAGGAAAA CCGGCCGTAC
ATCGGGGGCA GTAAAACGAA CTCCGTATTA GAGCTGGCCT TGAGCTATAA CGGGATCTCC
CGCCTCACGG GAATGAACCG GGGGGGCATG GGTCCCGGGA CAGCGCCAGG CCAGCGACAA
ATACAGCAGG ACCGGGGATC CTGGTCGGTA CAACAGCAAA TGCTCCAAGA TGGCAACAAT
GTACCCAACC GGCAGCAAAT GCCGCCGGAA GGCTTCGGCC CGGATGGTAA TGGAAATCCT
GGGCCGGTGC CCGGCCCAGG CGGGGGCGGG CCTCAAGGAA GAGGGGCCGG CGGTGCCTTC
GGTACCGGCC AGCCGGGTCC CCTGCGGCTG TTTCAAAGTG AACTCTCCGG ACAGATCAGT
TGGTTACTTC CCTTTGTAGC GTTCGCCTGC ATTGGCTTGC TGGCAGGCCT GCGCCCCGGA
AAGCCGCTGC CTGATAAACA GAAGGAGGCC CTGTTCTGGC TGGCCTGGTT GCTCCCGGCG
ATGGCGTTTT TCAGCGTAGC GGGCTTCTTC CATCACTATT ATCTGATCAT GCTCGCCCCC
CCGATTGCGG CCCTCACGGG AGCAGGCTGG GTAGAGCTCT GGAATCAATA CCGGGACAAG
GAAAGCTGGA AGAGGTGGCT CTTGCCGGCG GGTCTCCTGG CTACTACAGT TTTTGAACTC
TATATACTTA AGCCCTACCG GAACCAAATC GGCATGGGGT GGTCCATTGG CATCGGAGCG
GCAGGAATCG GGTTGGCGCT TGTACTGTTC CTTGCGGTAA ATAAACAAAA GCTGGCTTCA
AAGGTCGCCA TGGCCGGTAT GCTGGTATTA CTTGTGGCAC CATTGTACTG GGCAGCCACC
CCTATCCTGT ATGGCGAAAA TAGCATGCTG CCGCAGGCCG GTCCTAATCG CCAGGGATTC
GGTCCGGGAA GGATTACCCG GGGCGGGATG AACTCCGGTA TCAATACAAA ATTACTCGAA
TACCTGACCT TGAACAATAC AGGAGAAAAA TACCTTTTTG CCACCACCGA TGCCAATACG
GCCGCACCGT ATATCATTGA AACCGGAAAA GCCGTCATGG CCATGGGCGG GTTCAGTGGT
TCCGACCCTA TTCTTACAGT CGAGAAATTA AAGCAGATGG TTGCGAACAA AGAAGTAAAA
TTCTTCCTGA TTCCGTCACG GTCCGGTTTC GGAGGCGGAC CAGGCGGCAA TAACGAAGTG
CTGGATTGGA TCCGCGCCAA CAGCACGGAA GTTCCTATAG AAGAGTGGCA GTCCGACGCT
CCTCAGACAT TATATAAAAT CAACGACTGA
 
Protein sequence
MIMRPKSDKR YDMILAGIAL LSAFLNIFNI WEDRYANAYY TAAVTSMLQS FHNFFYASFD 
PAGYVTVDKP PLALWIQTIF AYFFGVHGWS VVLPQALAGV GSVLLIYHLV KPTFGKTAAR
IASLVMACTP IAVAVSRTNN IDSLLVFALL IATWMLFRAV RDQKPVWVLG AFAMIGVGFN
IKMLQAYMVV PAFYVFYLLA FKNEWKKKLA LLTAATVIMV GVSISWAAVV DMTPQENRPY
IGGSKTNSVL ELALSYNGIS RLTGMNRGGM GPGTAPGQRQ IQQDRGSWSV QQQMLQDGNN
VPNRQQMPPE GFGPDGNGNP GPVPGPGGGG PQGRGAGGAF GTGQPGPLRL FQSELSGQIS
WLLPFVAFAC IGLLAGLRPG KPLPDKQKEA LFWLAWLLPA MAFFSVAGFF HHYYLIMLAP
PIAALTGAGW VELWNQYRDK ESWKRWLLPA GLLATTVFEL YILKPYRNQI GMGWSIGIGA
AGIGLALVLF LAVNKQKLAS KVAMAGMLVL LVAPLYWAAT PILYGENSML PQAGPNRQGF
GPGRITRGGM NSGINTKLLE YLTLNNTGEK YLFATTDANT AAPYIIETGK AVMAMGGFSG
SDPILTVEKL KQMVANKEVK FFLIPSRSGF GGGPGGNNEV LDWIRANSTE VPIEEWQSDA
PQTLYKIND