Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0156 |
Symbol | |
ID | 4241749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 230297 |
End bp | 232696 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638105504 |
Product | transglutaminase-like |
Protein accession | YP_720123 |
Protein GI | 113474062 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.973257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0989488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAATT ATTTTTCTGG ATGGAAAGTG CGAAATCTAC CTATATTAGG ACTTCTGTGG AAACAAGTAC AAGCAGCACC TAAACCTATT GCTGAAGATT CTATTCTATT ACGAGTAATG GTACAGGCAT TGGTAATAGT AGGTATTATA GCAATTGACG TGGCTGCTGG GGTAGCAACA GATGAACTAC CTTTAACTGT TTGGGCAGGA CCACTAAGTA TTATAGGAGC AATTTGGAGT TGGTATAATC GCTACAAAAG AAATGTGCTA GCTAAATTCT GCTTGGCGAT CGGAATGTTA TTAATTTTTG CTGCTTTTTT GAGTAGATTA TTTGGTGAGC TAAATGATAC ACGACTGGTA TTGGCTGAAT TATTAATTCA ACTTCAAGTC TTACATAGCT TCGATCTACC TAGGCGAAAA GACTTAGGTT ATTCAATGGT TATAGGGCTA ATTTTGTTGG GGGTTGCAGG CACTATTAGC CAAACTTTAA CTTATGGACC TGTATTAATA GTGTTTACTA TATTTGCTAT TCCTTGTCTT GTTTTAGATT ACCGTTCTAG GTTAGGTTTG TACTCAAGAT TGTCCAGTAA TAAATTAGAG CTTCGAAGGC AAAAACAGCA AAAAAAAATA TCTCTGTTAA ATGCTACTTT GTCTCCCAAG CGATTAACAA TATTATTGTT AGTAATTCTT GGTTTGGGCA TGACAATTTT TGCTTTTTTA CCAAGGTTAC CTGGATATCA AATACGAACT TTTCCTGTGA GCAGTACGAT TAATTTTGAT TTAGAAAATT TCGATGGGCG AACAATTACA AATCCTGGTT ATATTAGTCA AGGTAGGGGA AATAGAGATG GTTCTGGAAA TAGTAATGGA ACTAACCAAG AATCAGGTCC TGGAAGTTTA AATGATACTT ATTATTACGG GTTTAATACT AAAATTAACC AAAATTTGCG TGGGCAATTA GTGCCTCAAG TTGTGATGCG AGTTAGGTCG CAAGCTGAAG GTTTTTGGAG AGTTTTAGCA TTTGATAAAT ATACTGGTGA AGGTTGGGAA ATTTCTCGAA ATGATAAGAC TATTACTTTA GATCGTAGCA GTTTTTCTTA TCAATTTTCT GTGGGTTTAC AGAGAAAGAA TTATAAAAAT GAAACTAAAG AAGTCATTCA AAATTATACA GTAGTTTCTC CGCTACCAAA TTTAATTCCA GCTTTATATC AGGCCAAAGA AGTATATTTT CCTACTAAAA AAGTAGCCAT TGACCTTGAT GGAAATTTAC GTTCTCCTCT TGAACTTAGA GAAGGTTTAA CCTATACAGT AATTTCTAAT GTTCCTTATC GAAATCGTAG TATTTTGAGT CAAGCGGGAA CAAATTATCC AGAAAGTATT AGTGAATATT ATTTACAGAT TCCTTCTGAA ATACTAGAAA AAGTCAGACA AAAAACAGAA GATATTTTAG CAAATAAAAA TAAAGTTGCT CAAGAAATTA AACCTATTAT TTCTCCTTAC GAAAAAGCTC TTTATTTAGC TCAATACCTT AAGCAAAACC CAAATTATAA ACTTCAAAAA GCACCACCAT TTTTAGCAGA AGATGAAGAT TTAGTCTCAG CTTTTTTATT TGGTTACCAA AATAGCCGGG AAGGGGAAAA AATTACGGGT GGTTATGGCG ATCATTTTTC TACTGTTTTG ACAATAATGT TACGTTCTAT TGGAATTCCT GCGAGATTAG TTGCCGGTTA TGCTCCTGGT GAATTTAATC CTTTTACTGG TTTATATGTG GTCAAAAATA CAGATGCCTA TATGATGACG GAGGTATATT TTCCGGGGTA TGGGTGGTTT GCTTTTAATC CTATTCCGGG AATGCCATTA ATTCCTCCTT CCATTGAAAA AAATCAAAGT TTTACTGGGT TAATGGTTTT TTGGAAGTGG GTAGCTGGTT GGTTACCTTC TCCTGTAACT GGTTTTTTGG AGAGGTTTAT TGGTGGAATA TTCCTTTGGA TATTGGGAGT TATTGCTTGG TTTTTTGCTT TATTTTCTCA AGGTTGGCAA GGCATATTTA CTGGTTTAAT ATTATTTACT GCTTTAGGTT TTTTGGGTTG GTTATTATAT ATTTTTTGGC AAAAATGGCA TTATGGTCGT TGGTTAGGAA GGTTATATCC TATGGAGAGT ATCTATCAAC AAATGTTGAA TTTGATGGCG AGTAAGGGAT TAGTTAAACA TGGTTTTCAG ACCCCTTTTG AATATACTAA GGCTATTAAG GGATATAACT CTAGGAATGA AGCAGAAATA ATAGAAGAAA TTTCGCAAGC TTATGTAGAA TGGCGTTATG GTGGTAAGGA GGTTAATTTG AATAGATTAA AAGGTTTACT GCGGAATTTA AAGAGTAGTA TCAGGCGGAA ATTAAATTAA
|
Protein sequence | MSNYFSGWKV RNLPILGLLW KQVQAAPKPI AEDSILLRVM VQALVIVGII AIDVAAGVAT DELPLTVWAG PLSIIGAIWS WYNRYKRNVL AKFCLAIGML LIFAAFLSRL FGELNDTRLV LAELLIQLQV LHSFDLPRRK DLGYSMVIGL ILLGVAGTIS QTLTYGPVLI VFTIFAIPCL VLDYRSRLGL YSRLSSNKLE LRRQKQQKKI SLLNATLSPK RLTILLLVIL GLGMTIFAFL PRLPGYQIRT FPVSSTINFD LENFDGRTIT NPGYISQGRG NRDGSGNSNG TNQESGPGSL NDTYYYGFNT KINQNLRGQL VPQVVMRVRS QAEGFWRVLA FDKYTGEGWE ISRNDKTITL DRSSFSYQFS VGLQRKNYKN ETKEVIQNYT VVSPLPNLIP ALYQAKEVYF PTKKVAIDLD GNLRSPLELR EGLTYTVISN VPYRNRSILS QAGTNYPESI SEYYLQIPSE ILEKVRQKTE DILANKNKVA QEIKPIISPY EKALYLAQYL KQNPNYKLQK APPFLAEDED LVSAFLFGYQ NSREGEKITG GYGDHFSTVL TIMLRSIGIP ARLVAGYAPG EFNPFTGLYV VKNTDAYMMT EVYFPGYGWF AFNPIPGMPL IPPSIEKNQS FTGLMVFWKW VAGWLPSPVT GFLERFIGGI FLWILGVIAW FFALFSQGWQ GIFTGLILFT ALGFLGWLLY IFWQKWHYGR WLGRLYPMES IYQQMLNLMA SKGLVKHGFQ TPFEYTKAIK GYNSRNEAEI IEEISQAYVE WRYGGKEVNL NRLKGLLRNL KSSIRRKLN
|
| |