Gene Tneu_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1675 
Symbol 
ID6165248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1474348 
End bp1475568 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID641668840 
ProductMoeA domain-containing protein 
Protein accessionYP_001795043 
Protein GI171186124 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.548066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.60016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTATC TCACGGGCTT AAACCCGGTT TCTAAGGTGG CCGACATCTT GCCCCAGGTA 
AAGAGGGTTG AGGAGGTGGA GAAGGTCGCC GCGTGGGACG CCGCCGGGCG AGTCGCCGCG
AGAGACGTGG TGGCTCCCCA CGACTACCCG CCGTATCCCA GAGCGGCCTA CGACGGATAT
GCGGTGAGCT CGGAGGAGAC GCCGGGGAGG TTCAAGCTCG TCGGGGCTGT TCTGGTGGGC
CAGTACAGGA GGGACCTCGT CCTCGGCCGG GGGGAGGCCG CCTACGTCAC AGTGGGGGCT
TTCCTCCCCG AGGGGGCCGA CGCAGTGGTT CCCGAGGAGG CGGCTAGGAA GGAGGGCGAG
CTGGTGGCGG TGGAGAGGAG GTTTGAGAAA TATGCCAACG TGGATCCGCC TGGCTCCTAC
GTGCGTAAGG GCACGGTGTT GCTGAGGCAG GGGACGGTCG TAACGCCTTT TGACGTAGTA
GGCCTTCTCG ACGTGGGGAT AACCTCCCTA TATGTCTATA GAAAGCTACG CCTTGGCATA
ATCGCGACGG GGGACGAGCT GGTGGCTCCG CCTATAGATC CAGAGGTCGC CGCCGAGCTT
GTGATGAGGG GGAGGGTGAT CGAATCCACG GCGTCTCTGG TCTCGTGGTA TATACAGACC
TACATGCCCT ATGTGGAGGT GCGGGAGAAG GCGGTGTTGG GGGATAGACA TGAGGAGGTC
CGCGCCGCCG TGGAGAGGTT TCTCCAGCAG TACGACGCCG TTGTAATCAC CGGGGGCGCG
GGGCCTAGCG AGATAGATCA CTTCTACAAG CTCGGCTTCG GAGGTCTGAG GGGCTTCCGC
ATGAAGCCGG GAAGGCCGAC CAGCGTGGCC GTAGTCGGCG GAAAGCCGGT CTTCGGCCTC
TCCGGCTACC CAATAAGCGC GCTCCACGGC GTGGTGAGGA TCGTGGAGCC GGTCCTCCGC
CACATGGCGA ACGTGGCTAG GGGGCTTGGC CACGGGTGGC ACTACGCGGT GATTACGCAG
GACGTGCAGG GGGAGATGGC CCAGATCGTG AGGGTGAGGC TTGAGGTGGG CGAGGGGGGA
CTACAGGCAA CGCCTATTAA GACGAGGCAC CACAGTTTTA CAGACCCAGA CGCGTGCGGG
GTGGCGCTGG TGCCCCCCGG AGGGGCGAAA AGGGGAGACG TGGTTCTGGT CCTCGCCTAT
CGGGACTTGA GGAGGGGCTA G
 
Protein sequence
MRYLTGLNPV SKVADILPQV KRVEEVEKVA AWDAAGRVAA RDVVAPHDYP PYPRAAYDGY 
AVSSEETPGR FKLVGAVLVG QYRRDLVLGR GEAAYVTVGA FLPEGADAVV PEEAARKEGE
LVAVERRFEK YANVDPPGSY VRKGTVLLRQ GTVVTPFDVV GLLDVGITSL YVYRKLRLGI
IATGDELVAP PIDPEVAAEL VMRGRVIEST ASLVSWYIQT YMPYVEVREK AVLGDRHEEV
RAAVERFLQQ YDAVVITGGA GPSEIDHFYK LGFGGLRGFR MKPGRPTSVA VVGGKPVFGL
SGYPISALHG VVRIVEPVLR HMANVARGLG HGWHYAVITQ DVQGEMAQIV RVRLEVGEGG
LQATPIKTRH HSFTDPDACG VALVPPGGAK RGDVVLVLAY RDLRRG