Gene Moth_1419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1419 
Symbol 
ID3832247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1463564 
End bp1464691 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID637829355 
Productperiplasmic binding protein 
Protein accessionYP_430275 
Protein GI83590266 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.364808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.918075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAGC ATTTTACCTG TAAATTGGTC AGCCTGCTCC TGGCCATCTT GATCTTGACT 
GCCGCCCTGG TGGGTTGCGG CTCGCAAGGG AGTGGCCAGA AGGCAACCGG CCAGGCTTCT
GCCGGTCAAG CAGCAGCTAA CAGGACTATA GTCGATATGG CCGGTCGCCA TATTACCGTC
CCGGTCGAGA TCAAGAAGGT ATTTGCCACC AGCCCGGTGG GAACCATTCT GGTTTACACC
CTGGCGCCGG AGAAACTGGC GGGATGGAAC TATGAGCTAA ATGAGGTAGA GAAGAAGTTT
ATCCTGCCCG AGTACCAGAA GCTACCCAAT CTGGGCGGCT GGTATGCCAA GAATACGGCC
AACATCGAAG AGATCCTGAG AATTCATCCT GAGGTTATCC TTTCTATGGG CTACATGGAT
AACACGGCCC GTTCCCAGGC TGACCAGATC CAGGAGCAGC TTAAAATACC GGTGGTGATG
GTTGACGGTG AACTGACAAA GCTGGACCAG GCTTATGAGT TTTTAGGCGA CCTGCTGGGA
GAGAAGCAAA GAGCCAAGGA ACTGGCGGCT TATTGCCGGG ACACTATCAA TGAAGCTGCC
GCTAAAGTTA AGGCGATGCC GGCGGACAGG AAGGTCCGAG TTTACTACGC CGAAGGTCCT
ACCGGGCTGC AAACGGATCC CGCTTCTTCC CAGCATACCC AGGTGCTGGA TTTTATCGGC
GGCATCAACG TGGCGGCCAT TCCACCCCAG CGCGGCCCAG GGGGCATGGG GATGAGCTCC
GTCTCTTTGG AACAGGTGCT ATCCTGGGAT CCCGATGTGA TTCTCTTCTG GAACGTAGCC
CAGGGAGGCG CCTACGAAAC TATCCTTAAA GACCCAAAAT GGCAGAACCT TAGAGCTGTG
AAAAGCCACC GCGTCTACCA GGTTCCCCAC GGGCCCTTCA ACTGGTTCGA CCGGCCGCCC
TCTGTCAACC GCATCATCGG GGTGAAGTGG CTGGCCAATC TCCTTTACCC GGATGTTTTT
AATTATGACC TGGTGGCAAC GGTCAAGGAT TTTTATGCCA GGTTCTACCA CTATAACTTA
TCCGATCAGG AAGCTGATAC CCTCCTGGCC GGGGCCAGGG GGAAATAG
 
Protein sequence
MFKHFTCKLV SLLLAILILT AALVGCGSQG SGQKATGQAS AGQAAANRTI VDMAGRHITV 
PVEIKKVFAT SPVGTILVYT LAPEKLAGWN YELNEVEKKF ILPEYQKLPN LGGWYAKNTA
NIEEILRIHP EVILSMGYMD NTARSQADQI QEQLKIPVVM VDGELTKLDQ AYEFLGDLLG
EKQRAKELAA YCRDTINEAA AKVKAMPADR KVRVYYAEGP TGLQTDPASS QHTQVLDFIG
GINVAAIPPQ RGPGGMGMSS VSLEQVLSWD PDVILFWNVA QGGAYETILK DPKWQNLRAV
KSHRVYQVPH GPFNWFDRPP SVNRIIGVKW LANLLYPDVF NYDLVATVKD FYARFYHYNL
SDQEADTLLA GARGK