Gene Tneu_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1007 
Symbol 
ID6164914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp899278 
End bp900735 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content66% 
IMG OID641668160 
Productextracellular solute-binding protein 
Protein accessionYP_001794385 
Protein GI171185466 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.256846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000168852 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGTGGCGG TGGCCATAAT AGCCGCCCTA TTCGCGGCGC AGCAACAACA GGCGCCTGCG 
GCGTCTCCCC CTCCTCCCAC GGCCTCGACG CCTGCGACCG GCACGGTGAC CTCCACGCCG
ACGACCACGG CGGCGTCTCC CACGCCTACG GCGACTTCTC CGGAGGTCTG CACCACGTTG
GTGGTCATAA CTAGGCACCC CACCGACATC CTAGACGCGG CGAGGGCCCT CTTTCTACAG
AGCGACGTGG CGAAGAGATA CGGCGTTAGA GACGTGGTGT TTAAGCCGGT CCCCGCCGCG
CAGTGGAGGG CCTTGATCCA GGCAGGCCAG GCGGACGTGG CGTGGGGAGG GGGGCCGACG
CTCTTCGACT CGCTGTATAA AGACGGCCTG CTCCTGCCGC TGGAGGGGGA CGAGGTGAAG
AGCGCGCTGG CCCAGATACC CAAGACCATC GCGGGGATGC CCATGATGCG CGTTGGGCCA
GACGGCAAGG TCTACTGGGT GGCCTGGGCG GTGTCCAGCT TCGGCATCAC CATAAACACG
CAGGTTCTCA AGGACCTCGG CCTCCCCGAG CCGAGGAGCT GGGCAGACCT CGCCTCGTTT
GAGTACGGGA AGGCCGTGCT GAGGGGTACC CCGGCCGCCG GCCTTTCGTC GCTGACTAAG
TCCACCAGCA ACACGAGGAT CGCCGAGATC ATCCTCCAGG CCTACGGCTG GGACGAGGGG
TGGAGGGTGA TCACTCTCAC GGCAGCCAAC GGCAGGATAT ACGGCGGTAG CGAGGCCGTA
AGAGACGCCG TGATCGCCGG CGAGATAGGC GCCGGGTGGA CCATCGACTT CTTCGGCTAC
ACGGCCCAGC TCCAGAACCC GGCGACGAGG TACGTCGTGC CCAACGACAC CTCCGTCAAC
GGGGACCCCA TCGCCGTGGT GAAGGGCACA AGGTGCCGCC AGGCGGCCGA GGCCTTCGTG
GCCTGGGTCA TCACCCAGGG TCAGGTGGTG GTGTTTGACC CCAAGGTGAA CAGGATGCCG
GTCAACCCCA GCGCCTTCGA CACGCCCGAG GGCAAGAAGA GGGCCGACCT AAAGGCCGTC
TACGACTACA TGCTCCACCT CAGGACTATA AACTTCAACG ACACCCTGGC CCTCGCCACC
GAGAACGTGG TGATGTACTA CTTCGACGCG GCGGTGACGG ACAACGCCCA GCTCCTCCAG
GACGCCTGGG CCAAGCTGGT CAAGGCCTAC CTCGGCGGGA GGATAGACAG AGCCAAGGCC
GTGGAGCTGG CCCGGCGCCT GGGCGAGCCC GTCACCTTCA AGGACCCCGA CACCGGCCAG
ATGGTCAAGC TGACGCTGGA GTACGCCATG AAGATCAACG ACAAGCTCAA GGACCCGGCC
TACCGCGACA AGATCTACGC CGCGTGGAGG GAGGCCGCCA GGGAGAAGTA CAGGGAGGTG
GCATCCCAAA TCCCCTAG
 
Protein sequence
MVAVAIIAAL FAAQQQQAPA ASPPPPTAST PATGTVTSTP TTTAASPTPT ATSPEVCTTL 
VVITRHPTDI LDAARALFLQ SDVAKRYGVR DVVFKPVPAA QWRALIQAGQ ADVAWGGGPT
LFDSLYKDGL LLPLEGDEVK SALAQIPKTI AGMPMMRVGP DGKVYWVAWA VSSFGITINT
QVLKDLGLPE PRSWADLASF EYGKAVLRGT PAAGLSSLTK STSNTRIAEI ILQAYGWDEG
WRVITLTAAN GRIYGGSEAV RDAVIAGEIG AGWTIDFFGY TAQLQNPATR YVVPNDTSVN
GDPIAVVKGT RCRQAAEAFV AWVITQGQVV VFDPKVNRMP VNPSAFDTPE GKKRADLKAV
YDYMLHLRTI NFNDTLALAT ENVVMYYFDA AVTDNAQLLQ DAWAKLVKAY LGGRIDRAKA
VELARRLGEP VTFKDPDTGQ MVKLTLEYAM KINDKLKDPA YRDKIYAAWR EAAREKYREV
ASQIP