Gene Moth_0726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0726 
Symbol 
ID3831002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp757587 
End bp758879 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content56% 
IMG OID637828657 
ProductType III effector Hrp-dependent outers 
Protein accessionYP_429587 
Protein GI83589578 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00533743 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.660892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA TATCCATCAT CGCCGACGAT TTAACCGGAG CCAACGATAC CGGCGTCCAG 
TTTTGCCAGC ACGGTTTCCG CACCATGGTT ATTATAGATG CTGCCAACGT AGAGCGGGTG
GGGCAGGATA AAGATGTCTG GGCGATCAAT ACCGACACCC GCCACCTGGC AGCACCTGAA
GCCTACCAGC GCGTTTATGA GATCACTTTA AAACTAAAAA AAGCTGCCAT CAGCCGGGTT
TACAAAAAGA TTGATTCCAC CCTGCGCGGC CACCCCGGCG CCGAGCTGGA GGCTGTGATG
GACGCCTGGC AGGCGGACCT CGCCCTGGTG GTGCCGGCCT ACCCGGCCAA CCGGCGGTTA
GTGGTTGACG GCCACCTGTT GATAAGCGAG GGCATGGAGA CGGCCGCGGC TTCCGTAAGC
CTTACTCCTG GCGATGCCAG GGCAGCCCTT TGCCACATCC CTACCGTCCT GCAGGGGGAG
ATGGGCCGTC GGGTAGGCCA GATTAACCTG GCGACTGTGC GCCAGGGAGT GAAAGAACTG
GTAGCTGCCC TGGAGGCCGC TCGTACAAAC AGCCAGGTGC TGGTCCTTGA TGCCGCCGAC
GAAGAGGACC TAAGGAATAT CGCCCGGGCA ATCAGCCGCT TCCAGCGGGA TGTCATTGTG
GCCGGCGCCG CCGGCATGGC CGCCCATTTA CCTCTGGCCT GGAACCTAAA ACCAGTGCCT
AATAATCCAT TAAATAAAAA GGGGGCTATT CTCCTGGTTG CCGGCTCGCG TAACCCGGTC
ACTGCCGCCC AGGTGCAACG CCTGGCTGAG GTTAGCGCGT GTCAGGCTGT AAAGGTAGAG
ACGGAAGCTA TACTTACCGG AGAACCGGCT GTTGAAATAG AAAGGGTGTT GCAGGAAGTT
ACAACTCAAG ATGCAGGCGC AGGTTTAATT ATTATAGCCG TAGATAGCCT TTTCCAGACA
ATTGACAGAG ATAGGGTTTC CAACTCAGGA AGCAAAGCTA TAGCTTTAGC CCTTGGCACT
ATCACCAGCC GCCTCTTAAA TATGCGAAGG ATAAGTGCCC TGGTAGTTAC TGGCGGAGAT
ACTGCCGTTC ACGTTTGCCG GGCTCTGGAA GCCAGAGGAA TTAACCTGGC GGCCGATCTG
TTGCCGGGTA TCCCTTTGGG GTACCTGGAA GGGGGGCGGG GTGATGGACT ACCAATCGTT
ACTAAAGCCG GCGGTTTTGG TTCCCCCGAT TCCCTGATCA AAGTAAATGA ATTTCTTCAA
CAGAGAATGA AAAGTGAAAT GGAGTTGGTA TGA
 
Protein sequence
MEQISIIADD LTGANDTGVQ FCQHGFRTMV IIDAANVERV GQDKDVWAIN TDTRHLAAPE 
AYQRVYEITL KLKKAAISRV YKKIDSTLRG HPGAELEAVM DAWQADLALV VPAYPANRRL
VVDGHLLISE GMETAAASVS LTPGDARAAL CHIPTVLQGE MGRRVGQINL ATVRQGVKEL
VAALEAARTN SQVLVLDAAD EEDLRNIARA ISRFQRDVIV AGAAGMAAHL PLAWNLKPVP
NNPLNKKGAI LLVAGSRNPV TAAQVQRLAE VSACQAVKVE TEAILTGEPA VEIERVLQEV
TTQDAGAGLI IIAVDSLFQT IDRDRVSNSG SKAIALALGT ITSRLLNMRR ISALVVTGGD
TAVHVCRALE ARGINLAADL LPGIPLGYLE GGRGDGLPIV TKAGGFGSPD SLIKVNEFLQ
QRMKSEMELV