Gene Moth_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1835 
Symbol 
ID3832804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1892074 
End bp1893207 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content61% 
IMG OID637829765 
Producthypothetical protein 
Protein accessionYP_430678 
Protein GI83590669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000242425 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCCGGG TTTTATTCTT GCGCCGGCGG TGGATGAAGC TGGCGGGTTC AGGCGTCATC 
CTCTTGGCCG GCATGGTCTT TTTCCTGCTT AATCAGAAAG GCCTGCCGGT AATAGCGCCT
TCCGCCCCGG GGGAGTTGAC GGAGCACCTG ACAACCATTT TTACGGCCCG GGCCAGGGCC
CTGGTCAACG GTAATTATGA AGGGCTGGAG GCTTTTTATG ATGCCACGAC GACCAGCGGC
CGGTTTGCCC TGAACCATGA AATCGGCCGC ATTAAATACG TCCAGGAATG GTTGCAAAAA
CGCCAGGTAA CCTTGACTGG CAGTCACCTG GACCTGGCCG TTGTCGACAG CGGTAGCGAA
GGGGATAAGG GCTGGGCCTC GGTATCCCAG CATCTGATCC TCAGTTACCG GCACCAGGGG
GAGCCGCAAG AAACAGTCAA CCGGATGGGG TTTCGTACCC TCCACTGGGT GGAGCTGGTC
AAGCGGGACG GCCGCTGGCT GATCAACCGC GACTGGTACT GGGACCCTTT TGAAACCGAC
GACCTGAAAC CAGAAATCGC CCCCGGCACG GCTGTATGCA AGGCGCTGCC GCCGCCGGTA
AAGGGTAAAT ACCGCCGTGA GGCGGCGGTG GTCTATGCCG ACCGCTACAG CGGCGTGCGC
CTGGGTCCCG GGGACGGCCG CTACAACCGG AATTACCGTG ACTTTACCGG CCTGGGCGGC
GATTGTGCCA GCTTTGCCTC CCAGGTCTTG AGCGACAAAG AAGCCGGCGG CATACCCCGG
GACTGGGTTT GGAATTACCA TAACGGCGAG GGCAGCCAGG CCTGGGCCCA GGCAGCCGCC
CTGGTCTATT ACCTCCTGGA CAGCGGCCTG GCGGTGCGCC TGGCAAGAGG TGATTTCCAG
GAAGTAACCC GGTCCACTTC TAATTACCCC TACGGGGCGG TCAACGCCCT GCAACCGGGT
GACATCATCG GTTATGAAGA AGGGGGCGAG TTAAGCCATG TCTCGGTGGT TGTAGGCCGG
GACTCGGCCG GATATGTCCT GGTCGACAGC CATACGGCCG ACCGTTACCA TGTCCCCTGG
GACATGGGTT GGAAGAGCGG GACCATCTAC TGGCTCCTCC AGGTAGTCTA TTGA
 
Protein sequence
MVRVLFLRRR WMKLAGSGVI LLAGMVFFLL NQKGLPVIAP SAPGELTEHL TTIFTARARA 
LVNGNYEGLE AFYDATTTSG RFALNHEIGR IKYVQEWLQK RQVTLTGSHL DLAVVDSGSE
GDKGWASVSQ HLILSYRHQG EPQETVNRMG FRTLHWVELV KRDGRWLINR DWYWDPFETD
DLKPEIAPGT AVCKALPPPV KGKYRREAAV VYADRYSGVR LGPGDGRYNR NYRDFTGLGG
DCASFASQVL SDKEAGGIPR DWVWNYHNGE GSQAWAQAAA LVYYLLDSGL AVRLARGDFQ
EVTRSTSNYP YGAVNALQPG DIIGYEEGGE LSHVSVVVGR DSAGYVLVDS HTADRYHVPW
DMGWKSGTIY WLLQVVY