Gene Nther_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2212 
Symbol 
ID6316592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2347769 
End bp2348998 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content45% 
IMG OID642644600 
Productmolybdopterin binding domain 
Protein accessionYP_001918366 
Protein GI188586821 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0521] Molybdopterin biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000001586 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00707587 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATACG AGTATGTAAA AACTCGAGAA GCTGTTGGTA TGGTGATTCC CCATGATATA 
ACAGGAATCA AACCTGGTGA ATTCAAGGGA GCTAAATTTA AAAAGGGACA TGTCATTACC
GAACAGGATA TTCCGGTATT ACTTTCTTTA GGAAAAGAAC ATATCTATTC AATGAAATTA
GAAGACGGAG AGGTTCATGA AGACGAGGCC GGAACCCGCA TTGCCCAAGC TGTTAGCGGT
CACGGGATTG CCTTCAAGGG TCCGGGGGAA GGGAAGGTAG ATCTATTGGC TGAAGTCCCC
GGATTATTGA AAGTAAATGT ATCTTCTTTA ACTCGGATTA ATGCTTTGGA TAAAGTGATT
TTGGCTTCAT TACACAATAA TACTCCGGTA AAAGAGGATG GAAAACTTGC CGGTACCAGG
GTGATCCCCC TGGTAGTTTC AGAACAGGTA GTCCAAGAAG TAGAAGAGAT TTGCCGAGAA
GAGGGCCCTG TGATTTCAGT AAAACCCTTT GAAAAATTGG AGATGGGTGT TTTGGTAACA
GGTCAGGAAG TTTATCAGGG GCGGGTAGAA GATGCCTTCT ATCCCGTACT GAAAGAAAAA
GCCAGAGAGC ATGGTTTGAC AGATCCTCGG GTTTGCTACG CCCCTGACGA TCCTTGTGAA
ATCAGTACTA ATATTCAGGC CTTGATTGAT CAGGAGGTCG ATTTAGTAGT GGTTACAGGG
GGAATGTCCG TTGATCCCGA CGATGTTACT CCCAAGGGGA TTCGCATGAG TGGTGCCCAA
ACTATCAGTT ATGGAGCCCC CGTGCTTCCA GGTGCTATGT TTTTGATGGC TTATCATCCT
GGGAATCAGG CGAAAGAGGC CAAACTAGGA GAAAACAGAC CAAAGGCAAA GGATTCATTT
GATGTATCTA ATTCAGGTGA TGCAAAAGAT CCAGCTAATC CAAGTCACGA CAGGAAAGAA
GTCGGAGCTG ATAAAGTTAA CTTCAGGGAG AATCCCAGGG AAAATCCAAG GGAAAATCCC
AGGGAAGTTC CCAGAGAAAT TCCCGTAGTA GGCCTACCTG CCTGTGGCAT GTTCTTTCAA
ACTACCGTTT TCGATTTAAT TTTTCCCAGG TTGTTGGCAG GGGAACGGGT TAGTTCAGAG
GAAATCGCCG CTCTTGGTCA TGGCGGTCTC TGTCTTCAGT GTGAACAATG CAAGTATCCC
AACTGTCAAT TTGGAAAGGG GAGTGTATAA
 
Protein sequence
MKYEYVKTRE AVGMVIPHDI TGIKPGEFKG AKFKKGHVIT EQDIPVLLSL GKEHIYSMKL 
EDGEVHEDEA GTRIAQAVSG HGIAFKGPGE GKVDLLAEVP GLLKVNVSSL TRINALDKVI
LASLHNNTPV KEDGKLAGTR VIPLVVSEQV VQEVEEICRE EGPVISVKPF EKLEMGVLVT
GQEVYQGRVE DAFYPVLKEK AREHGLTDPR VCYAPDDPCE ISTNIQALID QEVDLVVVTG
GMSVDPDDVT PKGIRMSGAQ TISYGAPVLP GAMFLMAYHP GNQAKEAKLG ENRPKAKDSF
DVSNSGDAKD PANPSHDRKE VGADKVNFRE NPRENPRENP REVPREIPVV GLPACGMFFQ
TTVFDLIFPR LLAGERVSSE EIAALGHGGL CLQCEQCKYP NCQFGKGSV