Gene Moth_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1422 
Symbol 
ID3832250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1466771 
End bp1467772 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content44% 
IMG OID637829358 
Productperiplasmic binding protein 
Protein accessionYP_430278 
Protein GI83590269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.158314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATAGAG CTATCAACAA AAAGTTGGTA ACTTTTTCGT TGATCAGTAT GCTGCTTTTA 
ATGACGTCTT TTATACTTGC CGGTTGCGGT AACCAGCAAA ACAAACCGCC GGCTGCAACG
GAAAAAACGG TTATGGATAT GGCCGGTAAA AACGTAAAGC TACCGGCTTC CATAGACAGG
GTGATTGTGA CCTGTTATGG CGGTGCCAGT CACGAGCTGG TAGTTCTGGG CGCCGGGGAT
AAAATCGTTG CCCAGCCATC CATGAAAAGA TTTCCTCAAC TCGTGAAAAT GATGCCCCGC
TTTAAAGATT TGCCGGATCC AGGTATTTTT GACAATGTCA ATATTGAGGC CATCTTAAAA
TTAAAACCCG ATCTGGTTGT GGCCAGTGTA ACTTCAACGA AAGGCAATCA AAAAATTGAA
GAAGCCGGTA TCCCGGTGAT TACCGTTAGT ACGGGCGTAG CGGATATTGA AGCTTTAAAA
AAAGAGTTCC GGATGATGGG GGAGGTCTTG AATAAATCTA ACGAGGCCAA TGCGCTGGTA
TCGTACTGGG ACAACTGGTT GAAGACCATC AAAGAGCGGG TGTCTAAGAT ACCTGAAGCA
AAAAGAAAAA GAGTTTATTA CATGCTGGGA GCACCGCTTC ATACCAACGG CAGTGCCTGG
TGGGGTCAAA CTTTAATTAC CGCTGCTGGT GGCCTCAATG TAGCCAGTGA GATTGGTAAA
GGTAGAGATA TTAATATTGA ACAGCTTTTA ACATGGAACC CGGATGTAAT CATCATCAGT
AGCAATGAAG GCCGCTTTAT TCCTATATCT GAAGTAAAAA ACAACCCTCA ATTCAAGGAT
TTGCAGGCTG TAAAGGAGGG CCAGGCCCAA ATCCTTTACC CCGAAAACTT TCGTGATGTA
GATTTGACTC AGGAAACGAT TAAGTTCTAC CAAACTTTTT ACCACTACAA CCTGACGGAA
CAAGACGTTA AGGAATTTTT CAATCCCGGT CCTTTGCAAT AA
 
Protein sequence
MYRAINKKLV TFSLISMLLL MTSFILAGCG NQQNKPPAAT EKTVMDMAGK NVKLPASIDR 
VIVTCYGGAS HELVVLGAGD KIVAQPSMKR FPQLVKMMPR FKDLPDPGIF DNVNIEAILK
LKPDLVVASV TSTKGNQKIE EAGIPVITVS TGVADIEALK KEFRMMGEVL NKSNEANALV
SYWDNWLKTI KERVSKIPEA KRKRVYYMLG APLHTNGSAW WGQTLITAAG GLNVASEIGK
GRDINIEQLL TWNPDVIIIS SNEGRFIPIS EVKNNPQFKD LQAVKEGQAQ ILYPENFRDV
DLTQETIKFY QTFYHYNLTE QDVKEFFNPG PLQ