Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1123 |
Symbol | |
ID | 6166078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1007929 |
End bp | 1010325 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641668275 |
Product | hypothetical protein |
Protein accession | YP_001794500 |
Protein GI | 171185581 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000327071 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000000331797 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTTGTTG TGTTTCTTGT GTTGTTGGCG GCGTTTGGCT GGGCGGGTGG GGGGCTGGCG GTTGTGGGTG AGTATGTGGA TTTGGGGGGC GTCGTCAAGG CGGCTCCTGG GGGTTATGTG TTTGTGGTGA CTGACGTGTG GCGGGGGCTG GGGAACTGTA GCGGGGCGGC GGAGGCGGTG GGCTACCCCT ACGCCGTTTT GCCGTCGGGG TCGTTTAGGT GTTTTATTGT GCCGGCTGGG GAGGTGGGGG CTAGGCTCCG GGGGGTGCCC GTCGTCGACT TGAGGGGTGG GCCGGTGGCG GTGGTGGCGG AGGCGCCGGG GGTTCTCTAC CTGGTGGACC TCGCCTCGGG GAGGCCGGCG GTGGGGATTA GGCTCGTGAG GGGGGTCAAC ATGTCGGTGG GGGCGGCCGT GTCGGCTTAC GTGGAGTGGC CCCTGGCCCC GCGGTACTTC CCCCTGGGGG AGGGGGCGGT GGTGCTGCGG ACCTGGCGGG AGTACCCCTG CGACGTGGAT CTCCGCGGCG GCCCGGGCAA CGCCACGGTG GAGGTGGAGA ACCTCTACCG CCTGTCGGGG CTCGTCTCGG GGTCCCCCCT CTACGTGGAG GGGGCGGGGG GACGCGCCGC CGTCTACCTC CCAGATCTCG CCGCACAGAG GGTCAGGCTC CAGCTGTGCA ACGTGGGGCC CCACGCGGCG GTCAACATCT CGGCGTTGCG CAACGTTAAC TACAGCTTCC TCTGGCTCCT CACCAGGGGG GTGGTCCTTA CGTCAGACGG ATCCGAGCGA TACTCAGTAT ACCCAGGTGC CGACGTAAAG ATAGGGCTGG AATACGCCAA CCCGCTGGGG GCCGTGTTGA TTAACATACT GGGTGTGATA AGCGCCTACG TCGGAACTCT GCTCACAACA ACACGCCGGA AGTTCTACAT AACAGCCCTC TCTCAAGCCC TAGGCACAGA CCTTATAAAA ATAATAAGAT TACCATTAAA TTTTTTTAAG ATAATGATAT ATTATTTAAC GCCTATTTTC TTCTTGAACA TATTTGTAAT TTTTATGATT TTCTATCTTA TTTCAAATAT TTTCAATAAT ATGAAAATTG CTGATATTAT AAATAAAATT TTGATTTTTA GTTATAATTT TATTCTTATA TTTTATATAA GTTGGATTGG AGGAAGATTT GCAAAAAACT ATGTGATGTC ATTAATTGTG GCGTTGGGGA TGTATATGGT AGTGTTCATC ATCTCAGTTT TAGGCATTAC ACTTTTTTAT GTCATACCTC TTTCTATGAT ATTGCATACA TATGAATATA CAACTATTTT AATTTATGGA TTTTTTCTCG TAATATTTTT TACTATTTTA ATAAAGGCAT CAAGTGACAT CTCTAGGCTC GATAGGTGGC TTTTGAATGC CATAATGCTC TATATTATAA ACTCACTCAA TATTTTATTA TTGTTTATTA CATTTACACC TTTTTTATAC ACATGGCTTT TGATTAATTA CAATCACTAT ATTTATGTAA CAAATATATT GATAGCTGTT ACTTTTCTAG TCATCATACT ATTTTTATCT TTAAATACTT ACGACGGTTG GCGTATTTTG TTCCCTGATC TAAACATCTT CGAGTTTTTT GCAAGGCGGT GGATCGATGA GAATAGGGCT TCTGTTGTGC GGAGCTCGGT GGTTTTGTCC AATGGAGAGG TTGTGGAGGG CCGTCTTGAG GTCGTTGGAG AGGATTACGT CGTCGTGTCT GGGAGGACGG TTTTGCTTGA CCAGGTGGCA ATGTTTGGGA TTGAGCTGGA GGAGAAGAAG GTAGCGGAGG CCCTCTTGGG CGGTGGCGAG GCTTCTGTCT TTGCCTACAT GCGGAGGGCT GTGGAGAGGG GGGTGGACCC ACTGGGGCAC GCCGTTGCGG TTGTGGCTTC TGCCGTGTTG AAGGCAAGGG GGATCGGCGG GCAGGTGGCT CCTGTGTGGG AGGTGATGGG GGATGGGGGG TTGAGGGGGG CGGCCGCCGC CCTCTTGTGC GCCTTTAGCA GGTTGCGGGG GTTGGGCGTG TCGGACTTCG GGGGGTGTAG AGTCGCCTCA ACGGCCTCGG TGGGTGGGCT GGAGGGTCTC GTTGTCGAGG TGGCCCGCCT GTTGGCAGAC GGCAAGGTGG AGGAGGCGGC GCGGCGGCTC TCGGGTCTAT ACTTGGCGGA TCCGACGGCG GATACGCTAG CCTATCGGCT TGAGAAGGCG GTTAGGGGGT ACGGTAAGAA GCGTGATAGG CGGTATGCGG AGGATGTGGC TGTTTGGCTT GCGGCGTTTT GGATATATCT GAGGGTGGCC GGGTTGCATG ACGACGGGGT GAGATATTCC CGAAGCCTAA GAAGGAGGTT TATGAGGTGG CTGATCAATG ACTTATTCTA TCGGTAG
|
Protein sequence | MVVVFLVLLA AFGWAGGGLA VVGEYVDLGG VVKAAPGGYV FVVTDVWRGL GNCSGAAEAV GYPYAVLPSG SFRCFIVPAG EVGARLRGVP VVDLRGGPVA VVAEAPGVLY LVDLASGRPA VGIRLVRGVN MSVGAAVSAY VEWPLAPRYF PLGEGAVVLR TWREYPCDVD LRGGPGNATV EVENLYRLSG LVSGSPLYVE GAGGRAAVYL PDLAAQRVRL QLCNVGPHAA VNISALRNVN YSFLWLLTRG VVLTSDGSER YSVYPGADVK IGLEYANPLG AVLINILGVI SAYVGTLLTT TRRKFYITAL SQALGTDLIK IIRLPLNFFK IMIYYLTPIF FLNIFVIFMI FYLISNIFNN MKIADIINKI LIFSYNFILI FYISWIGGRF AKNYVMSLIV ALGMYMVVFI ISVLGITLFY VIPLSMILHT YEYTTILIYG FFLVIFFTIL IKASSDISRL DRWLLNAIML YIINSLNILL LFITFTPFLY TWLLINYNHY IYVTNILIAV TFLVIILFLS LNTYDGWRIL FPDLNIFEFF ARRWIDENRA SVVRSSVVLS NGEVVEGRLE VVGEDYVVVS GRTVLLDQVA MFGIELEEKK VAEALLGGGE ASVFAYMRRA VERGVDPLGH AVAVVASAVL KARGIGGQVA PVWEVMGDGG LRGAAAALLC AFSRLRGLGV SDFGGCRVAS TASVGGLEGL VVEVARLLAD GKVEEAARRL SGLYLADPTA DTLAYRLEKA VRGYGKKRDR RYAEDVAVWL AAFWIYLRVA GLHDDGVRYS RSLRRRFMRW LINDLFYR
|
| |