Gene Moth_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1424 
Symbol 
ID3832252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1468667 
End bp1469719 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content58% 
IMG OID637829360 
Producttransport system permease protein 
Protein accessionYP_430280 
Protein GI83590271 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0609] ABC-type Fe3+-siderophore transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.320274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.156715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGATTGG CGCTGGCCAG GAAAACTTCG CCTGATATAC CCCTGGCCAG GGGGTACCGC 
TGGAGAGAAA TCGCTTTGCC GGCCCTGCCG CTGCTGGTAT TCCTGCTCTC TTTTCCTTTG
GGCCGTTATA CCATATCCCC CGGGCAACTC CTAACCATCC TGGCGGCAAA GGTATTTCCC
ATTGAACCCA CCTGGCCGGC CACCATGGAA ACAGTAGTCT TCCAGGTGCG TCTGCCGCGC
ATTATCGCTG CCATGCTGGT TGGGGCCGCC CTGGCCACCG CCGGCGCTGC CTACCAGGGA
ATGTTTAAGA ATCCCCTGGT TTCGCCGGAC ATCCTGGGGG CCTCGGCCGG GGCCGGTTTC
GGCGCTGCCC TGGCTATCTA TTTCTCCCTG GGTGTAGTCG GTATCCAGGT CAGTTCCTTT
CTATTCGGCC TCCTGGCCGT ATTTCTGGCT TACGCCTTAA GCAGCCGGAT CCGCCACGAC
CCTGCCCTGG TCCTGGTTCT GGCCGGCATC CTCACCGGTA CCCTGTTTTC TGCCGGTACT
TCCTTAATCA AGTACCTGGC CGACCCCTAC GATAAATTAC CAGCCATTAC TTTTTGGCTC
ATGGGCAGCC TGGCGTCCAT TTCTCCACGG GACGTTTATG CGGCCCTGGT GCCTGTGTTG
CTGGGAATAA TACCCCTTTA CCTCCTGCGC TGGCGCCTTA ACGTCCTCTC TCTGGGGGAA
GAGGAAGCCC GGGCACTGGG CCTGGAAACC GCCAGGTTAA GGTTGATTGT GATCCTGTGC
TCCACCCTGA TGACCGCAGC CTGCGTCTCC ATCAGCGGCA TGATTGGGTG GGTGGGGTTG
CTGGTTCCCC ATCTGGCCCG CATGGTGGTA GGGCCCAATT ATAAGGTTTT ATTGCCAGCC
ACCATCCTGA TGGGCAGCGC CTACCTGCTG TTGGTTGACG ACCTGGCGCG GGTGCTGGCA
ACGGTGGAAA TCCCCCTAGG TATCCTTACT GCTTTGATTG GCGCTCCCTT TTTTCTGTAT
TTATTACAGC ATACCCGGAG GGGATGGTTA TGA
 
Protein sequence
MRLALARKTS PDIPLARGYR WREIALPALP LLVFLLSFPL GRYTISPGQL LTILAAKVFP 
IEPTWPATME TVVFQVRLPR IIAAMLVGAA LATAGAAYQG MFKNPLVSPD ILGASAGAGF
GAALAIYFSL GVVGIQVSSF LFGLLAVFLA YALSSRIRHD PALVLVLAGI LTGTLFSAGT
SLIKYLADPY DKLPAITFWL MGSLASISPR DVYAALVPVL LGIIPLYLLR WRLNVLSLGE
EEARALGLET ARLRLIVILC STLMTAACVS ISGMIGWVGL LVPHLARMVV GPNYKVLLPA
TILMGSAYLL LVDDLARVLA TVEIPLGILT ALIGAPFFLY LLQHTRRGWL