Gene Tneu_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0126 
Symbol 
ID6165824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp112720 
End bp113967 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID641667292 
Producthypothetical protein 
Protein accessionYP_001793529 
Protein GI171184610 
COG category[R] General function prediction only 
COG ID[COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0200734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00611159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGTCG TCGTCGGAAT CGACGACACG GACAGCCACA GGGGGGGATG CACGACGTAC 
GTCGGCTACC TGTTGGCGAA GGAGGTGCTG AGGCGGTGGG GGGCAGGCGC CTTCAGAGAC
TTCCCGCGTC TCGTGAGGCT TAACCCAAAC GTGCCCTTTA AGACGAGGGG CAACGCCGCC
GTGGCGCTGG ATCTGGAGAT ACCGGAGGGC GACGTGGAGG AGCTCTGGAG GCTTGCGGTG
GAGACGGTGG CGGCCCACTC AAGGCGGGAG GGGAAGACGG ACCCAGGTGT GGCCATGGCC
GCCGGCGGCG TGCCCGAGAG GGCCAAAACG CTGTACCGCA TGGCGCTGAC GCAGGTAGTG
AGCATAAGCG CGGCGGAAAG GGCGGGGGTC CTCACATGGG GCGGACGGGG GAAGATCGGG
GCGGTGGCCG CCGTCGGCGC CTACTTCCCC AAGTCCACCT TCGAGCTCAT CGCCTATAGG
CGGGGCGACA GGGAGGCCAT CCCGCCCGAC CTCGTGAGGC TTCTGGAAGC TCTGACGTAT
CCCTACACCT TCCACAACGT AGACAGGCGG CGGGTGCTGA TAGAGCCCAG GGGGCCTGAC
CCGGTCTACT ACGGCATTAG GGGGCTCACC CCACAACACC TCAGATACGC CCTATCTCTC
CTCGAGGCGT GGGGCTACAG ACCCGCCGGC TGGGTCATAT ATAGGACAAA CCAAGCCACG
GACGCCCACA TAGAGCTCGG GGTCTTCTAC GGCGACCCCC TCCCCTACTC CTTCTACAGA
GCCAGGGGGC TGGTGGTGGA GGCGCGGAGG GTAGCCGGGC GGCACCTAGT GGGGAGGCTA
GACAGCGGCC TCCGCTTCGT GGCCTACAGA CACTTGGGGC GGCTCGCCTC GGAGCTGGAG
AGGTGCCTCC GGTGCGACGT GGTTCTCTAC GGAGGGCTGA AGCCCAGGAG GGGAGGCCTC
TACCTATACG TGGAGAGGGC CTACGTGCTG GGCAGGTACA TCCCGGCAAG GAGCCGCTGC
ACCTACTGCG GGGGATCGCT AGAGAGCCTG GGGAGAGGCA GAGGCTGGAG GTGCAGACGG
TGCGGCGCCG TCTTCCACAG CGCCCCGATC CGCTGGCTCT ACGACACAGC TCCGCGGAGG
GCTCTCCTCC CCCGACCCGG CGAGTGGCGC CACCTCCTCA AGCCGCCCGA CGTGGATCCC
ACAATACCCA ACTTCTTCAG CCCCAGCTCC GCCGAGTGGA TCGGCTAG
 
Protein sequence
MRVVVGIDDT DSHRGGCTTY VGYLLAKEVL RRWGAGAFRD FPRLVRLNPN VPFKTRGNAA 
VALDLEIPEG DVEELWRLAV ETVAAHSRRE GKTDPGVAMA AGGVPERAKT LYRMALTQVV
SISAAERAGV LTWGGRGKIG AVAAVGAYFP KSTFELIAYR RGDREAIPPD LVRLLEALTY
PYTFHNVDRR RVLIEPRGPD PVYYGIRGLT PQHLRYALSL LEAWGYRPAG WVIYRTNQAT
DAHIELGVFY GDPLPYSFYR ARGLVVEARR VAGRHLVGRL DSGLRFVAYR HLGRLASELE
RCLRCDVVLY GGLKPRRGGL YLYVERAYVL GRYIPARSRC TYCGGSLESL GRGRGWRCRR
CGAVFHSAPI RWLYDTAPRR ALLPRPGEWR HLLKPPDVDP TIPNFFSPSS AEWIG