Gene Tneu_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0139 
Symbol 
ID6164664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp127460 
End bp128584 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content58% 
IMG OID641667305 
Productextracellular solute-binding protein 
Protein accessionYP_001793542 
Protein GI171184623 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0201839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAACTA GGAGGAAGCT CCTAGCCGGC GCGGTAGCCG CGGCGGTTGT AGCCGCCTTG 
GGAGGTAGCG TTCTGCTTTC GAGGGGGCCG GAGAAGAGGG CGGCAACACT CGGCGGACAG
CTAGCCGTCT ACAACTACTC CTACTACATA GACAAGGACC TGCTCACGGA GTTCGAGAGG
GAGACCGGCG TTAAGGTGAT CTACCAGGAG TTCGAGAGCG GGGAGGAGGC CTACGCCGCC
CTACTCAGAG GCGGGGGCGG GTACGACCTA GTAGTGGTGC CGGATATGTA CCTAAAGGAG
GTGATCAAGG GGGGCTACGT GAGGAAGATG GACCACGGCA GACTAGCCAA CATCAACAAC
ATAGACCAGG CCTTCTTCGA CAACCCGAAC GACCCAGGCC TCCAGCACTC TATTCCATAC
GCCTTCGGCA CCACGGGCTT CGCCGTCAAC TACCACGCGA TGGCCGTAGA GGCCGGCAGA
AAACTCGAGA GCTGGGGCGA CCTCTTCGAC TTCGGTCTCC TGGAGAAGAT GAGAAACAAG
GTGGCTATGT TGGAGGAGTT CGTGGAGCCC GTCATGGCGG CTAAATACGC CCTGGGAATA
GACCCAAACG ACTGGAGCCA AGCGGCTGTG GACAAGGTCG CAGAGCTTCT GAAGAAGCAG
AAGGGCTACA TAAGAGGCTA CATGGGGGTC AGCCAGATCG TTCCGGCTAT AGCCGCCGGC
GAGCTGTGGG TTTCACAGAT CTGGAGCGGA GACGCAGCCA CGGCACGCGA CGAGTTTATC
AAACACGCCG GTGAGAAAAA CGCCGATAAG TTCGAGTACG TGTTGCCAAA GCCAATGACG
CACAGATGGG TCGACTTCAT GGTGATCCCC CGCGACGCGA AAAACATCGA CGCGGCATAC
GCCTTCATTG ACTTCCTGCT TAGGCCTGAG AACTCTGCTA GAATCACCAA GGCGTCTTAC
TACCCAACAG CGCTGAAGAG ACAGCTACTC GAAAAGCACC TCAGCCCCGA CATATTACAG
GACCCCACGG TCTTCCCGCC TGAGGGAGCC AAACTCATCT ATCTCAACTA CACAGACGAG
ATGATTAAGG CCGTGGAGAA GATCAGCTAC GCCGTCAAAG GCTAG
 
Protein sequence
MVTRRKLLAG AVAAAVVAAL GGSVLLSRGP EKRAATLGGQ LAVYNYSYYI DKDLLTEFER 
ETGVKVIYQE FESGEEAYAA LLRGGGGYDL VVVPDMYLKE VIKGGYVRKM DHGRLANINN
IDQAFFDNPN DPGLQHSIPY AFGTTGFAVN YHAMAVEAGR KLESWGDLFD FGLLEKMRNK
VAMLEEFVEP VMAAKYALGI DPNDWSQAAV DKVAELLKKQ KGYIRGYMGV SQIVPAIAAG
ELWVSQIWSG DAATARDEFI KHAGEKNADK FEYVLPKPMT HRWVDFMVIP RDAKNIDAAY
AFIDFLLRPE NSARITKASY YPTALKRQLL EKHLSPDILQ DPTVFPPEGA KLIYLNYTDE
MIKAVEKISY AVKG