Gene Tneu_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1644 
Symbol 
ID6165476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1452067 
End bp1453278 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content58% 
IMG OID641668807 
Productbasic membrane lipoprotein 
Protein accessionYP_001795012 
Protein GI171186093 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.848912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA TCTGGATCGC TCTTGGGGTA GTTGTAGCGA TCGCGGTTAT TGGGGCGCTT 
CTCGCGACGC AGTCGGCGCA GAAACAACAG CCTACTACAC AACCCCCGCC CCAGAAGACT
ATATACGTTA TATACGACAT CGGGGGGCGC GGGGACCTCT CATTCAACGA CATGGCGTAT
CTAGGGGCCT CTAAAGCCGC TAGAGACTTC GGGCTCGGGC TTAAGGAGGT GCAGAGCAAA
ACCCAAGACG ACTACGTGCC CAATTTGAGA GCAGCCGCGA GATCAGGAGA CGCCGCTTTG
GTGGTCGCAG TCGGGTTCCT TATGACAGAC GCGCTGAAGC AGGTTTCGCA GGAATACCCC
GCCGTCCACT TCGCGATTAT AGACGGCTAT GTGCCCAACA GGTCAAACGT GGTTTCCGTC
CTCTACCGCG AAAACGAGGG CTCGGCGCTG GTCGGCGCTC TGGCGGCGCT CACAGCCTAC
TACTACAACT GCACCAAGGT TGGGATAGTG CTGGGGATGG AGATCCCCGT CCTCTGGAAG
TTCGAGATCG GGTATGCCTA CGGCGTGAGG TGGGCTGAGA GGTATATAAA GCAGAGGTTT
GGCAAAGACG TAAAATTCGA CGTCTTGTAC GTATACACAG GGTCCTTCAA CGATCCAGCT
AAGGGCAAGC AAGCTGCTGA GGTCATGCTG TCGCAAGGCG TATGCGTGAT ATACCAAGCG
GCCGGCGCCA CGGGCCTAGG CGTGTTTGAG GCTGTGGCTG AGGCGGGGAA GAGGGCGGGC
AGAAACATGG GCCCGCCCTT CGCCATAGGC GTAGACGCCG ACCAGGACTA CATCAAGCCG
GGGTTCATAC TGGCCTCTAT GATGAAGCGC GTCGACGTCG GCGTCTACAG AGCGGCTAAG
ATGGCCGTCG AGGGAACCTT CAAAGGCGGC GTCTTGGAGC TTGGCCTCAA GGAGGGCGGC
GTGTCTGTGA GCACCTTGGA CGACCTAGGC CAGTTCCTCG AGATAGGGAT AAGAGCCGGC
GCTGTGAAGC AGGAGGACGC CCAGAGGATC ATCGATACCG TGAAGGAGAT GAGGTCTAAG
ATCCCCACCT GGGTGTGGGA GGCCGTGGAC AAGCTTAGGC AGGACATAGT GGCTGGCGTG
GAGAAGGTGC CTCTGCCTAC GACACAGGAC CAGGTTGTGA AGCTGAGGAG AGAGCTGGGC
TTAGCCGGCT GA
 
Protein sequence
MNKIWIALGV VVAIAVIGAL LATQSAQKQQ PTTQPPPQKT IYVIYDIGGR GDLSFNDMAY 
LGASKAARDF GLGLKEVQSK TQDDYVPNLR AAARSGDAAL VVAVGFLMTD ALKQVSQEYP
AVHFAIIDGY VPNRSNVVSV LYRENEGSAL VGALAALTAY YYNCTKVGIV LGMEIPVLWK
FEIGYAYGVR WAERYIKQRF GKDVKFDVLY VYTGSFNDPA KGKQAAEVML SQGVCVIYQA
AGATGLGVFE AVAEAGKRAG RNMGPPFAIG VDADQDYIKP GFILASMMKR VDVGVYRAAK
MAVEGTFKGG VLELGLKEGG VSVSTLDDLG QFLEIGIRAG AVKQEDAQRI IDTVKEMRSK
IPTWVWEAVD KLRQDIVAGV EKVPLPTTQD QVVKLRRELG LAG