Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1492 |
Symbol | |
ID | 8534650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1613337 |
End bp | 1616213 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646383883 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003263371 |
Protein GI | 261856088 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAAG AAGAGCTTAA TCGCCTCCAT CACGAAAACG CACGTCAGCA GGCCGAGATC ATCCATCTCA CAGCACTGCT CAATGAGCAG CGTGCCAAGA TGGTACGCAT ATATGGTTCC CGAAGCTGGC GTTACACCCA ATTTTTACGA TCAATCGATG GCCTGATACA TAAATACTTA TTAGGCAGCA CGCGTATTGA TCTGGTACCC ATACACCAAC TGGAACTAAA CAACACCCCG AAAGGAAAAG TTTGGGTTTC AACCGGAGAA GACCCCCAGT TTTTAATCAG AGCACACAAC CCGGCCGTGC TTAAACAAAC CGGCTGGGCG CAACTTGAAT TCCAGCTTCG CTGCGATGAA CCGCTGCCGA TGCAACTGTT TATCGATTAT GGGCAAGGCT ACAAAGACGA TATCGTCATC AACCATCACG CCGATACCTC GGGCTTGATT CAAATTCCGC TCTACATTCC AGTGGAACTA CTAGACATCC GTTTTGACCC AGCGGCGCAT CCCACCGAGT TCAAAATATC CGGCGTACGG CTCAAACGCC TCAAACACCC ACCCGCAAAT CAAGATTCAG CAAATCAAGG CGCATACAGC CACCTTGCGT CTATTGCCGA CATTGGTTTT CATCTCGAAG CCTCCAATCA AATCACCCGC TCCCTTACCG ACAGCCGCCA TTGGCTCTCC CACGGTGAAG ACCCCTACTT CACGCTCCAA TACAAAAACA CCCAAGCATT AAAACCCGGC TGGCATTGCG CCAAACTCAA TATCACCACC GAGGCGCAAA AGGGAATAGC CAAATTCTAT TTCGACACCG GTAGTGGCTA TAACGAAGCC CAAACATTGG CCATACCCTT TGAGAGTGGC ATGCTCATTG AGCGTGTTAT TTATTGCAAA GAAACCATCA AAACCATCCG CTTTGATCCC ATTGACTTTA AAGGTGGCTT CAAACTAGAA GCGCTCGATT TCAAATCTAT TGATACAACA CAAGCCGAAG AAACCATGCG TCACCGCATT GCCGAACAAC ATCCAGATTT TGTAGGTACA TCACCTGCCG AATTTGAACA ACGACTCGAC GAAGTCATCC GGGCAATGCA ACCGGATGCC ACCACCCCAA AAACAGACAC CCTGATCACG CTCTACAACC AAACATTCGA CACCCGCCCG GGTGCCATAG CTTATGATCA ATGGATTGAG CAGGTAGAAC AGGCACAACT CCCCACACAA GCAACCGTAC AGGCATTTTT ATCCAGCCAA TCGACACCTG TAATCATTTC CATTGTCATG CCGGTATACA ACACGCCCGA AAAATACCTG CGTTTATGTA TCGACTCCGT TCGCGCACAA TCGTATCCGC ATTGGGAACT GTGCATCGCC GATGACAAAT CGCCCCAACC CCATGTAAAA AAAGTACTCG ACGAGTACAT AAAAAAAGAC AAACGCATCA AGGTTGTCTA CCGACCACAA AACGGCCATA TTTCAAAAGC ATCCAACAGC GCACTCAAAT TGGCAACGGG CGAATATGTC GCCCTGCTCG ATCACGATGA TGCCCTGCCG GAACACGCCC TGTATTTCAT GGCACAAGCC ATTGCAGAGC ATCCAGAAGC ACAAATCCTG TACAGCGATG AAGACAAAAT CGACATCCAC GGCCAGCGCA GCGAGCCACA CTTTAAAAGC GATTGGAACC CCGACCTGTT CTATTCTCAG AATTATGTTT CCCACCTCGG CGTATACAAA CGCGAATTAC TCCAGCGCAT CAATGGATTC CGCACCGGCG TAGAAGGCAG CCAAGATCAA GACCTACTCC TGCGTTGCCT GCCGCATGTC AAAGCCGAAC AAATTATCCA CATCCCGAAA ATCCTCTACC ACTGGCGCAC CCTAGAAGGC TCCACCGCCA TGGCCTCGGG CGAAAAATCC TACACCACCG ATGCGGGCAT CAAAGCACTC AGCGATTTTT TTGAAAAAAA TGGTCCCGCA GGAATTAAAA TAGAACAAGG ACTGGTGCCA AACACCTACC GCGTACACTG GCCTATTCCC AACCCCGCGC CACTCGTGAG CCTGCTCATC CCCACGCGTG ATCGTAAAAC CATTACCGAA ATCGCCGTGC GCAGCATTCT GGATAAAACC ACCTACCCCA ATTACGAAAT CATTATTCTG GATAACGGCA GTGAGGAACC AGAAACCCTA GACTGGTTTG CCGCCATCCA GCAGGAAGAC GAGCGCGTAA AGGTTTTGCG TTACGACCAC CCATTCAACT ATTCCGCCAT CAACAACTTT GGCGCACAAC ACGCCAAAGG CAGCCTGATT GGCCTGATTA ACAACGATGT CGAAGTCATC AGTCCCGATT GGCTCACCGA AATGGTCAGC CACGCGCTGC GGGAAGATAT TGGCTGCGTG GGCGCCAAAC TCTATTACAG CAACGACACA CTGCAACACG CAGGCGTAAT TCTAGGGATT GGCGGGGTTG CAAACCACTC GCACAAAAAT TCCAAACGCG ACTCACCCGG CTATTTTGCC CGATTGATCG TTGCGCAAAA TTTCTCAGCA GTAACCGCTG CATGTCTGAT TATTCGCAAA TCGGTTTACG ACCAAGTCGG CGGGCTCGAC GAAGTCAACC TCAAGGTCGC CTTTAACGAT GTTGATTTCT GCCTCAAAGT ACGGGAAGCG GGGTACCGAA ACCTCTGGAC ACCCTATGCC GAACTGTATC ACCATGAATC CATCAGCCGA GGCACCGAAG ACAGTCCAGA GAAACAAGCT CGCTTCCGAG GCGAGGTAGA ATTCATGCAA TCCAAATGGG GTGATGCCCT GAAATTGGAT CCCTTTTACA GCCCGAATCT GACTCAGGAT CGGGAAGATT TTTCAATAGG CAATTAA
|
Protein sequence | MSQEELNRLH HENARQQAEI IHLTALLNEQ RAKMVRIYGS RSWRYTQFLR SIDGLIHKYL LGSTRIDLVP IHQLELNNTP KGKVWVSTGE DPQFLIRAHN PAVLKQTGWA QLEFQLRCDE PLPMQLFIDY GQGYKDDIVI NHHADTSGLI QIPLYIPVEL LDIRFDPAAH PTEFKISGVR LKRLKHPPAN QDSANQGAYS HLASIADIGF HLEASNQITR SLTDSRHWLS HGEDPYFTLQ YKNTQALKPG WHCAKLNITT EAQKGIAKFY FDTGSGYNEA QTLAIPFESG MLIERVIYCK ETIKTIRFDP IDFKGGFKLE ALDFKSIDTT QAEETMRHRI AEQHPDFVGT SPAEFEQRLD EVIRAMQPDA TTPKTDTLIT LYNQTFDTRP GAIAYDQWIE QVEQAQLPTQ ATVQAFLSSQ STPVIISIVM PVYNTPEKYL RLCIDSVRAQ SYPHWELCIA DDKSPQPHVK KVLDEYIKKD KRIKVVYRPQ NGHISKASNS ALKLATGEYV ALLDHDDALP EHALYFMAQA IAEHPEAQIL YSDEDKIDIH GQRSEPHFKS DWNPDLFYSQ NYVSHLGVYK RELLQRINGF RTGVEGSQDQ DLLLRCLPHV KAEQIIHIPK ILYHWRTLEG STAMASGEKS YTTDAGIKAL SDFFEKNGPA GIKIEQGLVP NTYRVHWPIP NPAPLVSLLI PTRDRKTITE IAVRSILDKT TYPNYEIIIL DNGSEEPETL DWFAAIQQED ERVKVLRYDH PFNYSAINNF GAQHAKGSLI GLINNDVEVI SPDWLTEMVS HALREDIGCV GAKLYYSNDT LQHAGVILGI GGVANHSHKN SKRDSPGYFA RLIVAQNFSA VTAACLIIRK SVYDQVGGLD EVNLKVAFND VDFCLKVREA GYRNLWTPYA ELYHHESISR GTEDSPEKQA RFRGEVEFMQ SKWGDALKLD PFYSPNLTQD REDFSIGN
|
| |