Gene Tneu_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1002 
Symbol 
ID6164842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp894066 
End bp895220 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID641668155 
Producthypothetical protein 
Protein accessionYP_001794380 
Protein GI171185461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000137488 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00202904 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGTTCCG CCGAGTGGCG GCGTAGAGAT AGGCGTTTGC TGGAGCTGGC GGCGCGTTGG 
TATGTAAAAC ACTACGTCGG GGGGAGAAAT ATAATGAAGC TCCTGGGCTT GTCGGAGGAT
GAGAGATATG GGGTGGCCTC TGCGTTGCTG GCCGAGTTTG TAAATGCGAT GACGGTAGTG
GTGAAGGCTG CGCTTAGGGA CCTCTTGGTT TATAGGGAAT TTGCTGTCGT ATACGGCGAC
GAGATCTTCG GCGCCTTAGA CGTGGGGAGG ACCGTGGCTG TTCTGCCTGC GGGGGTGTAC
GCCTCGTTGA CGTACCTCCC CTCGCTTCAA GCGCCTGAGT ATGGGATACT GCGGCGCATG
GCCTTGAAAG TATCGAAGTG GGGGAGGGAG GCCGCCGTCG ACGAGGAGAT GAGGAAGGAC
GCCGAGAGGC TGAGGCGGAT CGCCAAGAGG TTGCCTAAGG CATATGGCAG AGCGGCAGTG
GACGTAAGAG AGGGGCCGCC TTGGCTGAGA CAGGGCTGGG CTATCTACAG AGCCGCCAAG
GCGCTTGTGG AGGGGGAGGT ATACGTGGGG GAGAGGAGGG GCGTTGGAAA GGCGTTGAAG
TTCGTCAACT GGCGCCTATA CGAGATGTAC ATAGCTATGC TCGTGTTGGA GGCCCTACGG
CGCTTGGGTT GGAGGACGGT GGGGGTAGAC GCGGAGAAAC GCGCCGTCTT AGTCGAAAGG
GACGGTAAAA CGCTGGCGGT GTATCTTAAC AGAGCGTTGC CGCACCACTC CATAATAGAG
GAGGTCGCCG GGGACGAGGT GAGGGGGAGG CCGGATTTAA CTGTCGCAAA CGCCGATGTG
AAAGCCGTGG TGGAGTGCAA ATACTCAGAC AGGCCGGGCT ACATCGCAAG AGGCCGCTTC
CAAGTGATGA CATATATGTG TGAATATAGT GCGAAAATTG GGATATTGGT ATATCCAGCC
GCGTCGGAGG AAGAGGCCGA GGATGAAGAA GAGGAGGCGG CAGTTAGATG GGCAAACAGC
GGCAAGCCGA TCCGTATGAA GGACGGCCGC GCCATATACC CCCTGAGGAT AGACCCGGCT
TATGGAGCTA CCGCGGATGA GGCCAGGGAA AAACACATCG GCGAGGTGAT GAGGTTGCTG
GAGAGGTCAT TATAA
 
Protein sequence
MGSAEWRRRD RRLLELAARW YVKHYVGGRN IMKLLGLSED ERYGVASALL AEFVNAMTVV 
VKAALRDLLV YREFAVVYGD EIFGALDVGR TVAVLPAGVY ASLTYLPSLQ APEYGILRRM
ALKVSKWGRE AAVDEEMRKD AERLRRIAKR LPKAYGRAAV DVREGPPWLR QGWAIYRAAK
ALVEGEVYVG ERRGVGKALK FVNWRLYEMY IAMLVLEALR RLGWRTVGVD AEKRAVLVER
DGKTLAVYLN RALPHHSIIE EVAGDEVRGR PDLTVANADV KAVVECKYSD RPGYIARGRF
QVMTYMCEYS AKIGILVYPA ASEEEAEDEE EEAAVRWANS GKPIRMKDGR AIYPLRIDPA
YGATADEARE KHIGEVMRLL ERSL