Gene Tneu_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1076 
Symbol 
ID6166133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp961409 
End bp962557 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content68% 
IMG OID641668228 
Productmajor facilitator transporter 
Protein accessionYP_001794453 
Protein GI171185534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.197988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0589295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAT GGGGAGGGGT AGCGCGGCCC GTGGTGCCCG AGCTAAGGCT CATCGCGGTG 
GCAGGCGTGG GCTGGCTCTT CGACGCCATG GACGTCCTCC TCCTCTCGTA CATACTGGTG
GCGTCTGCGG CGGAGCTGGG GCTGGGGGTC TGGGAGAAGT CCGCCGTCGT GCTGGCCAAC
AACCTCGGCA TGTTGATAGG GGCAACCGCC TTCGGCAGAC TGTCGGATAG GCTGGGGAGG
AGGGCCGTCT TCACCGCCAC GCTCCTCCTC TACAGCCTCG CCACGTCCGC CACAGCCCTC
GTAAAAAACG GGTGGGAGCT GGCGGCGGTC CGCCTCATCG CAGGGCTGGG CCTAGGCGGC
GAGCTCCCCG TCGTCGCCTC GTACGTGTCT GAGCTCTCGC CGCCGGAGAG GAGGGGGAGA
AACGTGGTGG TTCTAGAGAG CTTCTGGTCC CTAGGCGCGT TGGCGGCAGC CGCCGTGGCC
TACTTCCTCT TCCCCCGCCT CGGCTGGAGA ACCTCCCTGC TCCTCCTCGG CCTCACCGCG
TTATACGCCG CGGTGATAAG GGCCGCCCTC CCGGAGCACA AGCCAGCCCC CCGGGGGGCC
GCCCCGGTTG AGGCTAGGCG GCTCTACCCC GTGTGGTACA TATGGCTGGC CCTGGCGTTT
GGATACTACG GAGTCTTCCT CTGGCTACCC ACCATCCTCG TCAGAGAGAG GGGGCTTGCC
GAGGTGCAGA CCTACCAGTT CATGCTAATT ACGACGGTTG CCCAGATCCC CGGCTACCTC
ACCGCCGCCT ACCTAGTGGA GAGGATAGGG AGGAGGCCCA CCGCCGCTGT CTTCTTCCTC
GGCTCCGCCG CCTCGGCGGC CGCCCTCATA TACAGCGCAG ACCTGCCCCA GCTCTACGCG
TCGGCCATCG CGCTGAACTT CTTCAACCTA GGCGCCTGGG GGGTGGTATA CGCCTACACG
CCCGAGCTCT TCCCAGAACA CGTCAGAGGC CTCGCCGTCG GGACCGCCGG CTCCGCCGCG
CGGGTGGGGA TGATCCTCGG CCCCTGGCTC TACCCCGCGG CCGGCCTCTA CGCCCTAGCG
GCGGTGCCCC TCCTCTGGCT CGCCGTCCCC GCCGCCGTGT ACGCCCTGCC GGAGACCAAG
AGGCGCTAG
 
Protein sequence
MLKWGGVARP VVPELRLIAV AGVGWLFDAM DVLLLSYILV ASAAELGLGV WEKSAVVLAN 
NLGMLIGATA FGRLSDRLGR RAVFTATLLL YSLATSATAL VKNGWELAAV RLIAGLGLGG
ELPVVASYVS ELSPPERRGR NVVVLESFWS LGALAAAAVA YFLFPRLGWR TSLLLLGLTA
LYAAVIRAAL PEHKPAPRGA APVEARRLYP VWYIWLALAF GYYGVFLWLP TILVRERGLA
EVQTYQFMLI TTVAQIPGYL TAAYLVERIG RRPTAAVFFL GSAASAAALI YSADLPQLYA
SAIALNFFNL GAWGVVYAYT PELFPEHVRG LAVGTAGSAA RVGMILGPWL YPAAGLYALA
AVPLLWLAVP AAVYALPETK RR