Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0227 |
Symbol | |
ID | 6164412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 202136 |
End bp | 203263 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641667392 |
Product | hypothetical protein |
Protein accession | YP_001793628 |
Protein GI | 171184709 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2407] L-fucose isomerase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTATC TCCTGACCTC GGCCGTACAC GGCGCCGACT TCATCGCCGA GGTGGAGAGA TACGTCGCAA AGTACATCTC CCTCAAAGAC CCGGCGAGGC CTGAGCCGGA GAGGTTCCCC GTCATAATCC ACGGCACCGG CGGGACAACC GCCCAGGCGC TGGAGCTCGT GGAGAGGGCT GGCGCACGCG GCGCCGTTCT AGTGGGCTTC GGCGAACACA ACAGCTTCGC CAGCGCGCTA CACGCCAAGG CCGAGCTGGA GGCCGCCGGC CGCACGGCCG TCGTCTACCA CTGCCCCACC TACGCCGAGT GCGGCCCCGC TTTAGCCAAG GCGGCTAGGG TCTCTGCCGC CGCCTCCTCC CTCATCGGCG CAAAGGCCGT GTTGATCGGT TCCAAGACCA AGCAGGCCGA CTTGGTTAGC GAGAGGTTCG GATGGTCGGT GGAGGTCGTG CCTCTTGCCG ACTTCGAATC GACCGTCGCG AACTCGGAGC CCGACAGCGA AGCCCTCTCT TTGTTCGGAG ATGAGAGGGT GGCGAAGGTC GCCTCTGCCC TCCGCAGGCT GGCAGCGGGG AGCCACCTCG TCGCCATACA GTGCTTCCCC TTCCTTATGA AGGCCGGATA CACCCCATGT CCAGCGCTCG CGTTGTTAAA CGCCAGGGGC CTCACGGCGG CGTGCGAGGG GGACCTCTCG GCTGGCTTCG CCATGTTGTT GCTGAGGAGG CTCGCCGGGG GGAGCAGCTG GATAGCCAAC GTTGTGGAGA GCCGCGGCGA GATGGCCACC TTCGCCCACT GCACGGTCTC GCTGGACCTA GTGGAGCGGT GGTGGTCCAT GCCCCACTTC GAGTCGGGCC TGCCCTATGG AATCGCCGGG GAGCTGAGGA GAGGCGTCTA CACGGCCGTG TCTATCTCGC CGAGGTTTGA AAAGGCGGCT GTGGGCCGGG TATACGTCGA GCGGAGCGGC AACTACCTAC AGAGCGCATG CCGCACCCAA GCCACGGTTA GGTTCGGGAG GCCGGTCCGC TTAGAGGAGG AGGCGCCCGC GAACCACCAC GTCTTTGCCC CTGGCGACGT GGCCGCCGAG GCCGCAGCCG TCTTAAAGCT CCTCTCCTTC TCAACCGTTA TATACTAA
|
Protein sequence | MAYLLTSAVH GADFIAEVER YVAKYISLKD PARPEPERFP VIIHGTGGTT AQALELVERA GARGAVLVGF GEHNSFASAL HAKAELEAAG RTAVVYHCPT YAECGPALAK AARVSAAASS LIGAKAVLIG SKTKQADLVS ERFGWSVEVV PLADFESTVA NSEPDSEALS LFGDERVAKV ASALRRLAAG SHLVAIQCFP FLMKAGYTPC PALALLNARG LTAACEGDLS AGFAMLLLRR LAGGSSWIAN VVESRGEMAT FAHCTVSLDL VERWWSMPHF ESGLPYGIAG ELRRGVYTAV SISPRFEKAA VGRVYVERSG NYLQSACRTQ ATVRFGRPVR LEEEAPANHH VFAPGDVAAE AAAVLKLLSF STVIY
|
| |