Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0333 |
Symbol | engA |
ID | 4243148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 509153 |
End bp | 510514 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638105665 |
Product | GTP-binding protein EngA |
Protein accession | YP_720280 |
Protein GI | 113474219 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTAC CTATTGTAGC TATTATTGGC AGGCCAAATG TGGGGAAGTC CACAATAGTC AATCGCTTGG CTGAAAGTAA AGACGCCATC GTCCACGATG AACCAGGAAT AACTCGCGAT CGCACCTATC GAAATGCATA CTGGGAAGAC CGGGAGTTTC AAGTTGTAGA TACAGGAGGT TTAGTATTTG ATGACAATAC AGAATTTTTA CCCCTAATCC GCGAACAAGC TATGGCAGCT CTAGTAGAAG CAAGTGTAGC AATTTTTGTG GTAGATGGAC AAACAGGTCT TACTGGAGGA GATGAAGAAA TTGCCCAATG GCTCCGTCAA CAAACTATTC CGATCCTCCT AGCAGTTAAT AAATGCGAAT CTATAACAGA AGGATTGACC CAAGCTGCTA TGTTTTGGGA ATTAGGTTTA GGAGAACCTT ATCCTATTTC TGGTATTCAT GGCAATGGTA CTGGAGAATT ATTAGACGAT TTAATTACCT ACTTACCAAC CCAAGGAGAA ATTACTGAAA CTAATCAAAC TAAAATAGCA ATTGTTGGTC GTCCAAACGT TGGTAAATCA AGTTTATTAA ATTCATTTAT TGGAGAAAAA CGTGCAATTG TTAGTCCTAT TTCTGGTACA ACAAGAGATG CTATTGATAC TGTAGTAGAA CGAAATGGCA AAACTTATCG TTTGATTGAT ACTGCTGGAA TTAGAAAGAA AAAAAATGTC GAATATGGTG CCGAATTTTT TGGGATTAAC CGAGCTTTTA AAGCTATTCG TCGTGCTGAA GTTGTTATGT TTGTAATTGA TGCTTTAGAT GGAGTAACAG AACAAGACCA AAAACTAGCA AATCGGATCA TAGAAGACGG TAGAGCTTGT GTTATTGTAG TGAATAAATG GGATGCCATA GAAAAAGATA ATTATACTAT TTATACTTAT GAACAGGAAG TAAGGTCGCG ACTATATTTT GTGGAATGGG CAGAGATGAT TTTTGTTAGT GCACTAACTG GAAAACGAGT GGAGAAAATT ATAAATTTGA TAGATAATGC AGCCAATGAA TATCAGCGTC GAGTGACAAC TTCTGTAATC AATGAGGTAT TAGAAGAAGC AATTAGTTGG AATTCTCCCC CCACTAATCG TCAAGGTCGT CAAGGTAAAA TTTATTATGG AACTCAAGTA ACAAGTAAAC CACCAACAAT TGCATTATTT GTCAACGATC CTAAACGTTT TCCTGAAAAT TATCGGCGTT ATATTCAAAG TCAATTTCGT CAACATTTAG GATTTACAGG TACACCAATA AGATTACTTT GGCGTGGTAA AAAAGCTAGA GAAGTAGAAC AAAATACTGT TAATAGAGCA ACTCGTGTCT AA
|
Protein sequence | MPLPIVAIIG RPNVGKSTIV NRLAESKDAI VHDEPGITRD RTYRNAYWED REFQVVDTGG LVFDDNTEFL PLIREQAMAA LVEASVAIFV VDGQTGLTGG DEEIAQWLRQ QTIPILLAVN KCESITEGLT QAAMFWELGL GEPYPISGIH GNGTGELLDD LITYLPTQGE ITETNQTKIA IVGRPNVGKS SLLNSFIGEK RAIVSPISGT TRDAIDTVVE RNGKTYRLID TAGIRKKKNV EYGAEFFGIN RAFKAIRRAE VVMFVIDALD GVTEQDQKLA NRIIEDGRAC VIVVNKWDAI EKDNYTIYTY EQEVRSRLYF VEWAEMIFVS ALTGKRVEKI INLIDNAANE YQRRVTTSVI NEVLEEAISW NSPPTNRQGR QGKIYYGTQV TSKPPTIALF VNDPKRFPEN YRRYIQSQFR QHLGFTGTPI RLLWRGKKAR EVEQNTVNRA TRV
|
| |