Gene Moth_1383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1383 
Symbol 
ID3831630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1429084 
End bp1430235 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID637829319 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_430239 
Protein GI83590230 
COG category[R] General function prediction only 
COG ID[COG2768] Uncharacterized Fe-S center protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.581713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGAAAC TGGATCTGGA GACTATCGCC GCTACCCTGA CGGGATTGGG ACAGGTACGA 
GTGCAGGGAG ATGCCTGTAT CAGGGGGAAG TCACCCCGGG TAACTTGCCG GCGGTGCCAG
GAAGTCTGCC CGGTAAAGGG TGTTGACCTG GGTAACGACC GGCCCGGGAT AAAGGATTGC
CAGCGTTGTG GCCTCTGCGC TGTAGCCTGT CCTGTGGGGG CCCTGGAGGA TCCAGAGCGG
ACCCACTCCT TTTTCCTGGC CCGGGGGCGG GAGAGTATAG TCGCCACCGG CAAAGCCCTC
TTTGCCTGCA ACCGGGGACT GGCAGACCAC CGGCGGGATG GCTGGATAAT AGCTTCCTGC
CTGGGGGCCG TCGCTCCGGA GGTAATCCTC GCCCTGGCTG TCAGGGGGCA AGTAGGTTTT
CGCTACCTCC CGGAAGAGTG TGCCGGCTGC CCCTGGGGGG ACAAGGGAGA GCGACTCTTC
CGCTCTTCTT TCGCCTGGGC CCAGCAGGCC CTGGGGGCTA TGGGTTTGCC CGGGGAGCGC
CTGATCCGGG GAGGGTATCT CAAGCCAGCC CCGGCTCATG GTGGTGCAAC TGGCAGGGCC
GGTGGCCCGG TCCCGGCGGT CATGGGCCGA CGCGAATTCT TCCGCTCCCT GGTATGCAAG
ATCAAAATTC CTGGAGTAGA AATTACCCCG CTCTCCCAAT CTCCCCAGGC TGTGAATGCC
AGGTCACGGG CCCTTATCCT GCAGCAGGCC CTGGAGGAGG CCAGGCCGGC AGGGGGTTAC
CCGGCAACGG CCCGCTTGCC CCTGGCTGCC CTGAAAGTAA CCGGTCCCTG TTACCTCTGC
AATATCTGCA GCCGGCTGTG CCCGACCGGG GCCCTGGAGT TGACGGAAGG GGAGTTGAGG
TTTAACCCAT CCCGCTGCAA CCACTGCGGC CTTTGCCTGG CGGTATGCCC CCAGCACAGC
CTGGCCTGGG GAGAGGACCT GCCTCTGGAG GCCATGGCAG CCGGGGCAAC CTGCACCCTG
GCTATCGTCA CAAATCACCG GTGCGCCAGC TGTGGAGAAA CCTTCCAGGC CGGCGCTACA
GCAATGGAAT GCCTGCGCTG CACCTTAAGC CGTGAGCTCC CCGGCGTGGC AGCCAGAAGG
GGCGGGGCTT AA
 
Protein sequence
MGKLDLETIA ATLTGLGQVR VQGDACIRGK SPRVTCRRCQ EVCPVKGVDL GNDRPGIKDC 
QRCGLCAVAC PVGALEDPER THSFFLARGR ESIVATGKAL FACNRGLADH RRDGWIIASC
LGAVAPEVIL ALAVRGQVGF RYLPEECAGC PWGDKGERLF RSSFAWAQQA LGAMGLPGER
LIRGGYLKPA PAHGGATGRA GGPVPAVMGR REFFRSLVCK IKIPGVEITP LSQSPQAVNA
RSRALILQQA LEEARPAGGY PATARLPLAA LKVTGPCYLC NICSRLCPTG ALELTEGELR
FNPSRCNHCG LCLAVCPQHS LAWGEDLPLE AMAAGATCTL AIVTNHRCAS CGETFQAGAT
AMECLRCTLS RELPGVAARR GGA