Gene Tneu_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1006 
Symbol 
ID6164939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp897878 
End bp899155 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID641668159 
Producthypothetical protein 
Protein accessionYP_001794384 
Protein GI171185465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.260042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0010639 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCCAA AGGTAGCCGC GTTCCTCCTA GCGGCGGCCC TCGCCGCCGC CCAACAGCTG 
TACGTCCCAG TTTTGGTAGA CGTGTCACAC GGCGAAGCCA CGAAGGGGCT CGACCTCTGG
GTAAACTCCA CGGCAAACCC CCTGGCGATT ACGGACTTCG CTAGGCTGTA CGTCCTCGTC
CCGCCAGACG CCAAGCTCGA CCCCACCCTG ACTAAGCTAA ACGCCACCAA GGCGGCCGTC
ATAATACGGG GAGACCTCTC GACGGTCGAC CTGTCCCAGT TTAAGGTAAT CGTCCTCGGG
CAGCCGCCTA AGCCCCTCAG CGAGGCGGAG CTAGCCGCCT TAAAGAAGTG GTTCGACTCC
GGCGGGAGAG TGCTGTGGTG CGCCGCCGAC TCCGACTACC CAGCCCAGGG CTCGGAGGAG
TCGCAAGCCG CCTGCAACGA CGTAGCTGAG TACCTCGGCG CCCACATCAG AGCCGACTAC
GTCTCGGTGG AGGACCCCCA GCAGAACGCG GGGGCTGCCT ACAGGGTGGT AGGCGTCGTT
GACCCGCCGC CTCAGCTGGC CTTCCTCGGC TTCCTCGCCC AGCGCGTCCT CCTCCACGGC
CCCGGGGCAA TCGCGGTGGT GCTGCCCGAC GGGAGGTGGG TCCCCGCCAC AAGCCCAGAG
GCGCAGAAGG CCTACGGCAA CATATATGTG ATAGTGAGAA CCACGGAGAA GGGGGCCATA
GTTGAGCACA GGACCTCCGC CGACGGCAAA GGCAGAGACG GGAAGGCCCA CAAGGCGGGC
GACCGCGGCG TCTTTGCCCT GATGGCCGCC GAGATTATGC CAAGCGGAAG CGTGCTCATC
CTCTCCGGCG AGACGCCATA CGGCGGCTAC GAGCCCATGG TCGCCCCGGT CTACTACAGA
GTCCAGCTAG ACGGGCCTAG GTTCCTCAGA AACATCCTCC TGTGGGCCAC CGGCAACTAC
AGAGAGCTCA CCACGATGGT CTACCAAGCC CGCCAGATGG CACAGATCGC GTCCGACGCC
GCCGCGCTGA AGAACACCGC CGCCTCTTTG CAAAACGAGG TCTCCGCCGT TAAAACAGCC
GTCTCGCAGG TCTCCGCCAA GGTAGACGCC GTGGGCGGCC AGGTGGCTGA GCTCAGCCAG
AAGGTGGACC AGCTCACTCA GCAGCTCAAC GCCGCCGTGG CCGAGGCCAA CAACGCCAAG
ACCACCGCCT TCGTCGGCAC AGCCCTAGCC TTGATCTTCG CCATAGCCGC AGCCGCCCTC
GCCATACGCA GGAGATGA
 
Protein sequence
MNPKVAAFLL AAALAAAQQL YVPVLVDVSH GEATKGLDLW VNSTANPLAI TDFARLYVLV 
PPDAKLDPTL TKLNATKAAV IIRGDLSTVD LSQFKVIVLG QPPKPLSEAE LAALKKWFDS
GGRVLWCAAD SDYPAQGSEE SQAACNDVAE YLGAHIRADY VSVEDPQQNA GAAYRVVGVV
DPPPQLAFLG FLAQRVLLHG PGAIAVVLPD GRWVPATSPE AQKAYGNIYV IVRTTEKGAI
VEHRTSADGK GRDGKAHKAG DRGVFALMAA EIMPSGSVLI LSGETPYGGY EPMVAPVYYR
VQLDGPRFLR NILLWATGNY RELTTMVYQA RQMAQIASDA AALKNTAASL QNEVSAVKTA
VSQVSAKVDA VGGQVAELSQ KVDQLTQQLN AAVAEANNAK TTAFVGTALA LIFAIAAAAL
AIRRR