Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4101 |
Symbol | |
ID | 4245615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6325134 |
End bp | 6326900 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638109002 |
Product | poly-gamma-glutamate biosynthesis protein |
Protein accession | YP_723582 |
Protein GI | 113477521 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.422401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.368838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTACA TACCTAATTT GACTCAAAAA TCTCTTTTTG AATTAGCAAG TTCCGGTGAT TTTCAGGCAA TTAGTCAATG GATTAATAAA AAACTTAAAC CTCAAGGAAT TTCAGCTCGT ATAGCTAAAG AAAATACTGG ATATCTAGAA GTTTTAGTAG AGTTTCAGAC TCAACCTCCT GTAGATAGAT TAATTAAGTT TATCTGTTAT CAACTTTCTC AACTTAACTA TCCTACACTA GAAAAAGTAA AAATTGTGGG GCGTTTAAGT GGTTCACCTA ATATACTATG GAAACACTCT GTCAGAATTA ATTCTCATGC AAAAAAAAAT TTCAATAAGC AATCTAATTT TGTAAACAAA AGTGATAATT TGCAATTTCA AACATTTCGT TATTTGATCT TACTCAGTTC AGCGGTTGCA GCTTTCATTA TAGGTATTTT AGTAAGTTAT TATAGTGTTT TGGTACGAAA TTCAACCTCA GGTCAATGGG TAGAAACTGC TATAGAAAAG GTGAGAGTTG TGGAGCATAG AAAGGTACAA AACTCCCAGG ATCCTATGGT GACTTTAATG TTTGGTGGAG ATGTTAATTT ATCTAACCAA GTTTCTAATT TAGTAAAGAG AGATTATAAG TTACCTTTTG CTAAAATGAA TGAGTATAGG GCTGCAGACT TATCAATAGT TAACCTGGAA AGTCCTTTGA CCCGTTCTAC TCTCAACAGT AGAACTCAGC AACAAAAATC AACGGTAAAT CCTAGTTATG TTAAGGCATT AACCTCAGGA GGAGTTGATC TGGTAAATTT AGCTAATGAC CATACTTTGG GTTATGAGCA AAAAAGTTTG TTAGAGACAA TAGAAACTTT AGAGAATGCG GGTATTCATT CTTTAGGAGC GGGCAAAACA GAAGAAGAGG CTAGAAGGCC AAAAATTTTT GAAGTTAAAG GCCAAAAGAT TGCATATCTC AATTACTATG ATACAGATAT TCAACCAACT ACTGAATCAG TATATGTAAA TAGTCGGAAT AAGGATAGGC TCTCTTCAGA TATTCAAATT TTGAAGAAGC AGGTAGACTG GATAATTGTT AATTATCATT GGGGGGTTCA ACTCTCAGAA TATCCTGGAG ATTGGCAGAT GAATATAGCG AGGATGACAA TTGACCAAGG TGCTGATTTG GTAGTAGGAC ATCATCCTAA AGTATTGCAG GGGGCAGAAA TTTATCGGGG ACGACCTATT ATATATTCTT TGGGAAATTT TATTTTTGGA GACACTTCTA ACAAAGAGAG TGATTATGAC ACAGCAGTTT TGAAGGTATC TTTAAAACCA GGAAAAATGA AGATTGAGTT TTTGCCTGTA GTGGTTAGTA AGTACCAACC CCACATTGTC AAAGGTGAAA AAGGTAAAGA AATTCTTAAA CACATTGCTC AAATTTCTAG TATTTTTCAC CAGCCAATGA GAACTCCTAT AATAATAAAT ACGATAAATG ATGATTTTAA TTTTGTTGGT ATTGACTCTT TTCCTAGGGA AGAAAATTCT AAAACTTTCT CAACTCCAAT TTTACCTGAG TTACCTCTAA AATCTCCACA AGCTGATCCA AATCCTACAA GCTCTTCTCA TAATAATTCA GAGCAAGAAG CAAGTAATAA TAATAATAGC TTTTCTTTAC CACCAATATT AAGTCCTGCA CCCACTCCTA AAGAAAGAAT AGATCCTTTC ATTAAAAAGC CATTTATCAA AGAACCTTTT ATTGAATTGC CTCGTTTACA AATTTAA
|
Protein sequence | MNYIPNLTQK SLFELASSGD FQAISQWINK KLKPQGISAR IAKENTGYLE VLVEFQTQPP VDRLIKFICY QLSQLNYPTL EKVKIVGRLS GSPNILWKHS VRINSHAKKN FNKQSNFVNK SDNLQFQTFR YLILLSSAVA AFIIGILVSY YSVLVRNSTS GQWVETAIEK VRVVEHRKVQ NSQDPMVTLM FGGDVNLSNQ VSNLVKRDYK LPFAKMNEYR AADLSIVNLE SPLTRSTLNS RTQQQKSTVN PSYVKALTSG GVDLVNLAND HTLGYEQKSL LETIETLENA GIHSLGAGKT EEEARRPKIF EVKGQKIAYL NYYDTDIQPT TESVYVNSRN KDRLSSDIQI LKKQVDWIIV NYHWGVQLSE YPGDWQMNIA RMTIDQGADL VVGHHPKVLQ GAEIYRGRPI IYSLGNFIFG DTSNKESDYD TAVLKVSLKP GKMKIEFLPV VVSKYQPHIV KGEKGKEILK HIAQISSIFH QPMRTPIIIN TINDDFNFVG IDSFPREENS KTFSTPILPE LPLKSPQADP NPTSSSHNNS EQEASNNNNS FSLPPILSPA PTPKERIDPF IKKPFIKEPF IELPRLQI
|
| |