Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0726 |
Symbol | |
ID | 8533863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 788244 |
End bp | 790250 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646383112 |
Product | transglutaminase domain protein |
Protein accession | YP_003262622 |
Protein GI | 261855339 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGCA AGATGCAGCC ACTGGATCGT CCCCTGGGTA TTGTGATTGC GCTCGCCATC GCCGATCATC TGCCCCATAT GCCCATATGG GCCATACCTC TAGTGTTGCT CGGGCTGGTC TGGCGTTTCC AACATGAATG GCGCGGCTGG CCCTTACCGA CTCAACCCCT GCTCATTTTG CTGGCCATCG TTTTCAGCTT GCTGGTGCTG TTTACCTACC AGAGCCTTTG GGGGCGGGAT CCCGGCGTCA CAATGCTCAC GCTCGGTGCC CTGCTCAAAC TCCTCGAAAC CCGGCAACTG CGGGATCAGT TCGCTCTGCT GCTGTTGGGG TTCTTCCTGA TCGTCAGTCT GCTGCTCTTT GACCAAGGCA TATTCACCGC CTTGATTTCC CTTGGTTTGT TCGCAGGCCT AGTCACAGCC TGGATTGGTA TAAGCAACCC GAATATCCGC CTGATTAAGC CACGTATCAA AGCGGCTTCG GGCTTGATGC TGGCCGGTTT GCCGATTGCC CTTCTGCTTT TTTTGCTTTT TCCTCGTCCG CCCGGTGCCT TATGGGGCAC CAAACAACCG ACGACGGCAC AGGCCCGAAC AGGGTTGTCT GATCAGCTCA CCGCAGGCGA ATTTGAGCGA CTATCCACCG ATCCAACACC GGCATTTCGT GTCCACGTTA ATGGCGCAAT GATTCCGCCT CCTGAGCGCT ATTGGCGGGT GTTGGTCATG AGCGATGAGA CAAACAACAC TTGGCATGCG GATATCCCCA ATATTTTCCG CCCCGTGCGT GCGCCAAATG TGCGCGTTGA CCCGACCAGT GCCGTTGAAT ACACCGTCAC ACTGGAACCG TCAGATCGCC GGTTTTTGCC CACGTTGGCC ATGGCGACGG TACTGCCGCG GCAATCCGTT CTGAGTGATA CCGGCAGTCT GTTTTCGCTT CGCCCACTGA ACGATCGTTA TCGTTACACC GTTACCTCGG CGATACGTTA CCACCTCGAT GAACGCCGAT TATCAGCCGA CACCCGCGAA CGCAACCTGG CCTTGCCCAC AGGTGACCCC AAACTCAAGG CACTGGCAGC CCAATGGAAG GGGCTTGCCC CGCTCGAAAT TCGTGACCGG GCGTTGAATT ACTTTCGCTC GCAAGGATTC AGCTACACCC TCACGCCGGG CAAACTGCCC GAGCAAAACC GAATGGACAC CTTTTTGTTC GATACCAAAC GCGGTTATTG CGAACATTAC GCCAGTGCCT TCACCTTGCT GATGCGTGCC GCCGGTGTTC CCGCCAGGAT CGTGACGGGT TATCAAGGGG GCGAAGTCAA CGGCGATTAT CTGCTGGTGC GTCAAGCCGA TGCCCATGCC TGGAGCGAAA TATGGGTCAA AGGCGAGGGG TGGATACGCG TCGATCCAAC CCAAACAGTT GCCCCGGAAC GCATTACTGA CGGCATCGCA CAAGCCGCAC AACAAGATGC CGCTTTGCCC GCCTCGCTGC GCCGTGATGA CAGCCTGGCC CGTCAATGGT CGTTACTGGG CGATCGGGTT GAAAACGACT GGAATCAGTA CGTTCTTGGG TACAGCGGCA GCACTCAAAC CGACCTGCTG CAATGGTTTG GTCTAGAAAA AATCAGCCAG TGGGGAAGGT TTGGATTTGC TTTTATTATC GTCGCAATCG CATGGATCCT GATCTTCGGC TTCTGGCGGC ATCTTAGCCG ACCGGAAAAA CACATGCCGC CCGTTGATCG CGCGTGGTTC GCCGTCGCGA CCGCGCTTTC CCGATTAGGC ATTACGCGTC AACCGGGAGA AACCCTGCAA GCCTACTGCC GGCGGGCGGA ACAGTCCTTG CCCAATCACG CGGCAGCAAT CAGACAGACC CAGCAGTTGA TTTCTCAATG GCGATACGCC CCCCGCTTCA GCAAACGCAG CCAAGCCCTA GCCGAAAAAA CGGCACAAAA CCTGAGCACA CAGCTTAAGT GGTTGCGATG GCGGGAGAAC GTGTTTCAGG GACGGAAAAG AAATTGA
|
Protein sequence | MNSKMQPLDR PLGIVIALAI ADHLPHMPIW AIPLVLLGLV WRFQHEWRGW PLPTQPLLIL LAIVFSLLVL FTYQSLWGRD PGVTMLTLGA LLKLLETRQL RDQFALLLLG FFLIVSLLLF DQGIFTALIS LGLFAGLVTA WIGISNPNIR LIKPRIKAAS GLMLAGLPIA LLLFLLFPRP PGALWGTKQP TTAQARTGLS DQLTAGEFER LSTDPTPAFR VHVNGAMIPP PERYWRVLVM SDETNNTWHA DIPNIFRPVR APNVRVDPTS AVEYTVTLEP SDRRFLPTLA MATVLPRQSV LSDTGSLFSL RPLNDRYRYT VTSAIRYHLD ERRLSADTRE RNLALPTGDP KLKALAAQWK GLAPLEIRDR ALNYFRSQGF SYTLTPGKLP EQNRMDTFLF DTKRGYCEHY ASAFTLLMRA AGVPARIVTG YQGGEVNGDY LLVRQADAHA WSEIWVKGEG WIRVDPTQTV APERITDGIA QAAQQDAALP ASLRRDDSLA RQWSLLGDRV ENDWNQYVLG YSGSTQTDLL QWFGLEKISQ WGRFGFAFII VAIAWILIFG FWRHLSRPEK HMPPVDRAWF AVATALSRLG ITRQPGETLQ AYCRRAEQSL PNHAAAIRQT QQLISQWRYA PRFSKRSQAL AEKTAQNLST QLKWLRWREN VFQGRKRN
|
| |