Gene Tneu_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1513 
Symbol 
ID6165825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1345274 
End bp1346401 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID641668670 
Productmajor facilitator transporter 
Protein accessionYP_001794883 
Protein GI171185964 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.519208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATATA CGTCGTCGGC TACGAACACG TCAAGAGCTG ATTTAAGAGC CTACGTCCTC 
CTCTCTACGC CCTACTCCTT CGTAGTCTTC CTACTGCCGT TCTATGTGCT TGAGATAGGC
GGCGGCTCGG TAGAGGTCGG CGTGGCCTAC GCGTCATATG CGGCGGCTGT GGTGGTGACG
CGTCCTACGT CGGGCGCGTT GGCGGATAAG TTCGGCAGAC GCAGGGTTAT GCTACTGGGT
GGCGCAACGC TGGCGGTTTC GATGGCCCTT CTGGCACTTT CCACCGGGGT GGCCCACGTG
TACATATCCC TCTTCTTGGC TGGAGCGGCG TCCAGCTTAG TCAACGTGGC GGCTCTGGCT
TATGTGTCAG ACGTCGGCGG GCTGGAGGAC CCCGCGCTCT ACTCGAGGCT GAAGACCGCG
GCGGCCTTAG GCGCGTTGGC GGGCGGGGCG TCCATCCCGG CTGTGTATGT CCTCTCTAGG
CTTCTCAGCT TCGCAGACGC CTTTAGGCTT GTGGCGGCTG TTCTAGCACT TCTGGCCGTC
TCAGCTCTTT TGGCCGTCCC GGGCGAGACG AAGCACCTTG CCGCCAGACA CAAGAAGGGC
GACCGGGTCC AGACCTTCTG CGTGATGTCG CTGGCTACGG CGTTCGGCTC CGCGGTGGGC
CTCTACGGCC CTCAGGTGAT GCTCTACCTC CACAGGAGGT ACTCGCTGTC TCCCTACACC
GCCGTCGTGG CGTATCTACC CTCGGTGGTG TCGTGGATAG TGGGGCCTAG GCTTGCGGGG
CCCGCCTATG CGAGGTTGAT CGCGGGAGGC GCCGCGATGG CTCTAGCGCT CGTGGGCATG
GCGGTCTCTC CATCGCCGTA TGTCTTCTCG GCGTTTTGGG CCATCGAGAG CCTTGGGGTC
GCCGCCGTCT CAACCTCCCT AGACCAGAGG CTGGTTAGAC ACGTCGCCGG GTCCTACTGG
GGTAGGGGCT ACGGCCTCTA CCAGGCGTTG TACAATCTGG GTTACTCCGC CGCGGCAGCC
GTCTCGGGCT TCTTCGACGA CCCCTTCACC CCCGCGCTGG CCCCCCTCTC CGCGGCTTTG
CTCACGGCGG CTGTGTGTAG TAACCTACAA AAACGCCGAG CGGCATGA
 
Protein sequence
MGYTSSATNT SRADLRAYVL LSTPYSFVVF LLPFYVLEIG GGSVEVGVAY ASYAAAVVVT 
RPTSGALADK FGRRRVMLLG GATLAVSMAL LALSTGVAHV YISLFLAGAA SSLVNVAALA
YVSDVGGLED PALYSRLKTA AALGALAGGA SIPAVYVLSR LLSFADAFRL VAAVLALLAV
SALLAVPGET KHLAARHKKG DRVQTFCVMS LATAFGSAVG LYGPQVMLYL HRRYSLSPYT
AVVAYLPSVV SWIVGPRLAG PAYARLIAGG AAMALALVGM AVSPSPYVFS AFWAIESLGV
AAVSTSLDQR LVRHVAGSYW GRGYGLYQAL YNLGYSAAAA VSGFFDDPFT PALAPLSAAL
LTAAVCSNLQ KRRAA