Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2478 |
Symbol | |
ID | 4245247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3812518 |
End bp | 3815760 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638107561 |
Product | protein splicing site |
Protein accession | YP_722160 |
Protein GI | 113476099 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.242027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAT TAGTAACAGG TGGCGCAGGT TTTCTGGGCT CTCATCTTAT TGATAGATTA ATAGAACAAG GCCATGAAGT CCTGTGCCTT GATAATTTTT ACACTGGCAA TAAACATAAT ATTTATAATT GGTTAAATAA TCCTAGCTTT GAGTTAATTC GCCACGATAT TACAGAACCC ATTAGGTTAG AAGTAGATCA AATTTATCAT CTAGCTTGTC CTGCTAGTCC AATACATTAT CAGTACAACC CCGTCAAGAC AATTAAAACA AATGTAATGG GGACATTAAA TATGTTAGGT TTAGCCAAAA GAGTAAAAGC AAAATTCTTT TTGGCTTCTA CATCTGAAGT ATATGGCGAC CCAGATGTAC ATCCTCAAAC AGAAGAGTAT AGAGGTAATG TTAATTGTAT TGGAATTCGC TCCTGTTTTG ATTCTAAAAC TGAAATTTTG ACTGAAGCAG GTTGGGTAGC TTTTCCAAAT CTACAATCAG AAGTTAAAGT AGCAACATTA AACTCAGAAG GTAAAGTTGA ATATCATATT CCTGAGGAGT ATATTGTACA ATCATATATT GGAGAAATGT ACCGTTTTGC TAATACTAAC TTTGATTTTT GTGTGACTCC TAATCACTGG ATGTATGTGC GTAATAAAAC TGGTAATTTA GAGTTTATTA GGGCAGATGA AGCTAAACTT TGGCAGAGTT TAGAGGTGTT AACTGGAGGA GATTTTGAGG GAGAAAAAGA AGAATGGTTG GAATTAAGAA AAAGCCCTAT AAATTCTCAT AGAAAAGTAG AAAAAATCTT TATGGATGAC TGGCTAGAAT TTTTAGGTTA TTACATTTCT GAAGGTAGAG TTGATGTAAA AAAAAGCTTG CGAGTAGTAG GGGGAAATGA TGCTTATGTA GCAGACTATA ATATATTAAT TGGTCAAGAA AACTCTGAAT TAGCTTTGAA AATAGCTAGT TGTTTACGCC GTTTAGGATT TAATTTTAGT GAAATTTTAT TTGATTCAGA TAAACATCAA TTTCGGGTTT GTAGTAAACA GCTAGCTGAG ATGTTATTAC CTTTAGGTAA GTCTGGTGAA AAATATATTC CTCGCGAATT ATTGAAACTA TCAAAACGAC AGTTATTAAT TCTATTTAAA GCACTGATAA TGGGTGATAA TTCAGAACAA AAAAATCATT ACACTTATTA TAGTAAATCT AAGCGATTAG CTGATGATAT ACAAGAGTTA GCACTTCGTT GTGGTTATGC GGCTACAGTA GTTTCTCATG CAGTGGGGCG TGACCTTTAT CAAGTTAATA TTCGACCTGC TGAGGATGCA AATTTAGTTG TGCCAGAAAG GTTCCATTAT GTAGGAAAAG TTTATTGTGT TAATGTCACA AACCATGTGG TTTTTGTAAG ACGAAATGGG CGTGCTGCTT GGTGCGGTCA ATGTTACGAT GAGGGGAAGC GAGTAGCAGA AACTTTAGCT TTTGATTACC ATCGCCAAAA TAATGTTGAT ATTAGAGTGG CTAGAATTTT TAATAGTTTG ACTGGAGATC AAAAGGTACT TTATTATATT GCTAAAAAAC TGTATTATGA AACTTTTGCA GAATGCTATG ATAGAATTAA TGGGGATATA TCAAGTGTAT CAGTGCCTTG TTTTGATGAA AATTATCAAA CGGTAATTAA ACCAATTTCT GCAATTTGGA AACATCATGT CAAGAAGAAG GGTTTTAAGA TAAAAATTAC TTGGGGTAAA CAAATAAAAA TCACAGAAGA TCATAGTCTA TTTACGAGAA ATGAAAATAA TAAACCTCAA GCAGTTTTCG GAAATGAGAT AAAGGTAGGA GATGAAATAG GAATTCCTAG TTATATCAGT TTTTTGGAAC AGCCTTTAGA GCCATTCCAC ATTACAGATA AAATATTAAT TCAAGAAGAA ATATATGTAG AAAGTGAAGA TACAATCTCT TATATTGAAA AGTATGGAGA TAAGATGAGG GAGTATTTAT TGGCAAAAAG TTTAAGCCCT AGTCAATTTT ATTCGATATT GAAAACTTAT GAAGCAAAAA ATCAAATTCC TTGGCATCTA TGGAAATATT TAGAACTTCC TCTATCGGAA AAAGATAAAG TTTGCTATTT ATCTAAAAAG GCAATTAAAA ATTGGATAGA CAATGTAGAA GAATTATTAT GGTTCTTAGG CTTTTATGTA GCTCGGGGAA GTTTGATCAA AAATGAAGTT GTGTTGAAAG GGGAACCTAG TCAACTAGAA AAAGTTATAG AATTAATAGA AAGAATCTTT GAATATAAGT CGGAAATTAA TGATTCAGGA TATATAAGTA TTAAGTCTAA AATTCTGGTT GATTTAATCG GATATGGATT AAATTTTGGG AATCAAGAAA AAGATATTCC TAATTGGATT TTACAACTAC CTGAACAACA ATTAATTAGA TTTTTGAAGG GATTCGTTGC TGGGAATAAC CTAGAAAATC AACTAAATTT CTATCTCGAA TTTAAAACTG ATAGTCAGTT AGTTGCTGAA AAATTAGTAT TGATTTTATC TAAATTTGGG TTGGTAGCAG ATGTGTCAGA AATAGAGGTA AATGAGGAAG ATATTGCCAA AATTTATCGA ATAATAATAG AGGGATTAGA AGATAAAAAT ATTCATAATT TGTCTAAGGT TGAGCAGAAA ATATCAGCTC TAACTACAGG TGATATTGCT TGGGGAAAAA TAGAAAGTAT TGAGGAGTTT GAAATAGATG ATTATGTGTA TGATTTCTCC GTGCCAAATT ATGAGAACTT TATAGGGGGA AGCTACAATG TTTTTGCTCA CAATACTTAT GGACCTAGAA TGTTAGAAAA TGATGGTCGA GTTGTTAGCA ATTTTATTGT ACAAGCCTTA AAAGGGATAC CTCTAACAGT CTATGGTGAT GGCTCCCAAA CCAGAAGTTT TTGTTATGTT TCAGATTTGA TAGAAGGGTT TATTCGGTTA ATGAATCAGG ATTTTATCGG GCCAGTTAAC TTGGGAAACC CAAGAGAATA TACAATATTA GAATTAGCTC AGAAAATTCA AACAATGGTT AACCCAGGTA CAGAGATAAT ATATAAACCT CTACCACAAG ATGATCCAAA ACAACGACAA CCAGATATTA CTCGTGGGAA AAAATATTTG GGTTGGGAGC CAACTGTTTT TCTTGAAGAA GGGTTAAAAT TGACTATAGA AGATTTTCGA GAGCGACTCA AAAATGAATT GCCAAAAAAC TAA
|
Protein sequence | MRILVTGGAG FLGSHLIDRL IEQGHEVLCL DNFYTGNKHN IYNWLNNPSF ELIRHDITEP IRLEVDQIYH LACPASPIHY QYNPVKTIKT NVMGTLNMLG LAKRVKAKFF LASTSEVYGD PDVHPQTEEY RGNVNCIGIR SCFDSKTEIL TEAGWVAFPN LQSEVKVATL NSEGKVEYHI PEEYIVQSYI GEMYRFANTN FDFCVTPNHW MYVRNKTGNL EFIRADEAKL WQSLEVLTGG DFEGEKEEWL ELRKSPINSH RKVEKIFMDD WLEFLGYYIS EGRVDVKKSL RVVGGNDAYV ADYNILIGQE NSELALKIAS CLRRLGFNFS EILFDSDKHQ FRVCSKQLAE MLLPLGKSGE KYIPRELLKL SKRQLLILFK ALIMGDNSEQ KNHYTYYSKS KRLADDIQEL ALRCGYAATV VSHAVGRDLY QVNIRPAEDA NLVVPERFHY VGKVYCVNVT NHVVFVRRNG RAAWCGQCYD EGKRVAETLA FDYHRQNNVD IRVARIFNSL TGDQKVLYYI AKKLYYETFA ECYDRINGDI SSVSVPCFDE NYQTVIKPIS AIWKHHVKKK GFKIKITWGK QIKITEDHSL FTRNENNKPQ AVFGNEIKVG DEIGIPSYIS FLEQPLEPFH ITDKILIQEE IYVESEDTIS YIEKYGDKMR EYLLAKSLSP SQFYSILKTY EAKNQIPWHL WKYLELPLSE KDKVCYLSKK AIKNWIDNVE ELLWFLGFYV ARGSLIKNEV VLKGEPSQLE KVIELIERIF EYKSEINDSG YISIKSKILV DLIGYGLNFG NQEKDIPNWI LQLPEQQLIR FLKGFVAGNN LENQLNFYLE FKTDSQLVAE KLVLILSKFG LVADVSEIEV NEEDIAKIYR IIIEGLEDKN IHNLSKVEQK ISALTTGDIA WGKIESIEEF EIDDYVYDFS VPNYENFIGG SYNVFAHNTY GPRMLENDGR VVSNFIVQAL KGIPLTVYGD GSQTRSFCYV SDLIEGFIRL MNQDFIGPVN LGNPREYTIL ELAQKIQTMV NPGTEIIYKP LPQDDPKQRQ PDITRGKKYL GWEPTVFLEE GLKLTIEDFR ERLKNELPKN
|
| |