Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3798 |
Symbol | |
ID | 4243746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5836695 |
End bp | 5839820 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108732 |
Product | small GTP-binding protein |
Protein accession | YP_723316 |
Protein GI | 113477255 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTATT CTGAAACAAG CGAGCAGTAC TATATATCAG AGGATGTAAA AAAAAGAATA CAAGAAGCTA AATATCAAAA GCTTAAGTGG TTATATTTAA GCGGGTGTAA ATTAACTGAA GTTCCTGGTG ATGTTTGGGA ATTAGAGCAG TTAGAAGTAT TAGATTTAGG CAGCAATGAA TTAACAAGTC TGCCGGAATC AATTGGCAAA CTCTCCAATT TAACTTCGCT TTATTTAGTC AATAATAAAT TAACAAGTCT GCCGGAATCA ATCACCAAAC TCTCCAATTT AACTGAGCTT TATTTAGATG GCAATCAATT AACAAGTCTG CCGGAATCAA TCACCAAACT CTCCAATTTA ACTGAGCTTT ATTTAAGTGT TAATAAATTA ACAAGTCTGC CGGAATCAAT TGGCAAACTC TCCAATTTAA CTTCGCTTGA TTTAGGTGGT AATCAATTAA CAAGTCTGCC GGAATCAATC ACCAAACTCT CCAATTTAAC TGAGCTTTAT TTAGGTCACA ATCAATTAAC AAGTCTGCCG GAATCAATCA CCAAATTATC CAATTTAACT GAGCTTTATT TAGGTCACAA TCAATTAACA AGTCTGCCGG AATCAATCAC CAAATTATCC AATTTAACTT CGCTTGATTT AAGCTGGAAT AAATTAACAA GTCTGCCGGA ATCAATCACC AAACTATCCA ATTTAACTTC GCTTTATTTA GGTAGTAATC AATTAACAAG TCTGCCGGAA TCAATCACCA CACTCTCCAA TTTAACTGTG CTTGATTTAG GTAGTAATCA ATTAACAAGT ATGCCGGAAT CAATCACCAA ACTCTCCAAT TTAACTGAGC TTTATTTAGA TGGCAATCAA TTAACAAGAC TGCCGGAATC AATCACCAAA CTCTCCAATT TAACTAAGCT TGATTTAAGG AATAATCAAT TAACAAGACT GCCGGAATCA ATCACCAAAC TCTCCAATTT AACTAAGCTT AATTTAAGCT GGAATAAATT AACAAGTCTG CCGGAATCAA TTGGCAAACT CTCCAATTTA ACTTCGCTTT ATTTAAGGGA TAATCAATTA ACAATTCTGC CGGAATCAAT CACCACACTC TCCAATTTAG GATGGCTTTA TTTAAACAAT AACCCCCTCG AAAACCCACC AATAGAAATT GCTACAAAAG GAATACAAGA AATTAGAGAC TATTTCCAGC AAGAACGAGA AAAAGGAATA GATTATATCT ATGAAGCAAA GTTACTAATT GTTGGAGAAG GTGGAGCCGG CAAAACAACT TTAGCAAATA AAATTCTCGA CCAAAACTAT CAACTCAAAG ATGAAGATAC AACCAAAGGA ATTGAAGTTC ATCAGTACAA ATTTCAGACA AAAAACCAAA ATGACTTTCA AATAAATATT TGGGATTTTG GCGGACAAGA AATTTATCAC GCTACCCACC AATTTTTCCT GACCAAACGC TCCCTATATA CCCTTGTCGC TGATACCCGC AAAGAAGATA CAGACTTTTA TTATTGGCTC AATGTAGTAG AATTATTAAG CGGTAATAGC CCCTTATTAA TAGTTAAAAA TGAGAAACAA GAACGGAAAC GAGAAATTAA TCAACGGGCA TTACAAGGGC AATTTACTAA TATTAAAGAA GTCTTAGCTA CCAACCTTAA AACTAATCGT GGCTTAGAAG AAATTATCAG AGAAATCGAA CATCATATTA GTAAGTTACC CCATGTTGGC AGTCGTTTAC CTAAAACTTG GAAACAAGTC CGGGAAATAT TAGAATTAGA TTCCCGTAAC TATATTAGTC TAGAAGAATA TTTATCTATT TGTGAACAAA ATGGCTTTGA AAAAAGGGAA TATAAATTAC AATTAAGTGG CTATTTACAC GACTTAGGAA TTTGTCTTCA TTTTCAGGAT GACCCACTCT TAAATAAAAC AGTTATTCTC AAACCAGAAT GGGGAACAGC GGCAGTTTAT AAAGCCTTAG ATAATCAGAC AGTTCGTAAT AATTTCGGCG AATTCACCAA AGATGATTTA GCCAATATCT GGAATGAAGA AAAATATGTG AATATGCGGG ATGAACTGCT GCAATTAATG ATTAGATTTA AACTTTGTTA TAAAATTTAT GGCAATTCTC AGACCTATAT TGCTCCTCAA CTCTTAACAG AAAATCAACC AGAATATGAC TGGGATGAAA GTAATAATTT GATTTTACGT TACACCTACG AATTTCTGCC CAAAGGAATT ATTACTCAGT TTATTGTAGC TATGCATAAA GATATTGAAG AACAGAAATA TGTCTGGAAA AGCGGAGTTA TTCTCAAGAA AAATGAAACA AGAGCAGAAG TAATTGAATA TTACAACATA AGAGAAATAA AAATCAGAGT TTCTGGGCGG GAAAAACAGT ATTTAATGAC TATCGTAACT TACGAATTCG ATAAAATTCA CAGTTCCTAT AATAATCGAC TAAAATATAA TCAGTTAATT CCTTGTAATT GTAATGTTTG TCAAAATAAT CAGAACCCAA CATCATACAA ATTTGAAATT TTAAAGAACC GAATAAATAA TGGTAAGGAA ACAATTGAAT GTGATTATCC TCCCTTCTAT GAAGTTAATA TAAAAAGCTT AATTGATGAT GTAATAAATA TAAGTGAGTA TGATAAACTC TCACAGTCTA ACTCTCATCC AAATAATGTA TATAATAAAT ATTATAGTTA TACTAATTAT GGCGGAGACT GGAATATAGA CGCCAAAGAC ACTAATAGTC AAATTATGGG AAATAAAACC ATGAATAAAA ACATTGACAA AAGCCGAAAA ATCGAAAATA AAGACGGAAA TATTACAGGT AATGTACTGG GTGACGAAAG TAATATCGAA GGAAATATAA CCACACATTC TCCGGCGCCA AAAACCGAAA AACCCGATAA ATTTGGATGG CTAGATTTTT TGACTTGGGC GCAGAATGGG CAAATGATTT TCTATTTAGC TTCTGCCACT ATCGGCATTT TTGTACTTAT TATTTATCCG AAACTATTTC CATCTGGTTT CCCTAAATTA ATAGAGAAAG TTCAAGAGAT ATTTCCAGCT CCGCAAGAAG AAATTGAGTC CGAACAAAAT CAATAA
|
Protein sequence | MTYSETSEQY YISEDVKKRI QEAKYQKLKW LYLSGCKLTE VPGDVWELEQ LEVLDLGSNE LTSLPESIGK LSNLTSLYLV NNKLTSLPES ITKLSNLTEL YLDGNQLTSL PESITKLSNL TELYLSVNKL TSLPESIGKL SNLTSLDLGG NQLTSLPESI TKLSNLTELY LGHNQLTSLP ESITKLSNLT ELYLGHNQLT SLPESITKLS NLTSLDLSWN KLTSLPESIT KLSNLTSLYL GSNQLTSLPE SITTLSNLTV LDLGSNQLTS MPESITKLSN LTELYLDGNQ LTRLPESITK LSNLTKLDLR NNQLTRLPES ITKLSNLTKL NLSWNKLTSL PESIGKLSNL TSLYLRDNQL TILPESITTL SNLGWLYLNN NPLENPPIEI ATKGIQEIRD YFQQEREKGI DYIYEAKLLI VGEGGAGKTT LANKILDQNY QLKDEDTTKG IEVHQYKFQT KNQNDFQINI WDFGGQEIYH ATHQFFLTKR SLYTLVADTR KEDTDFYYWL NVVELLSGNS PLLIVKNEKQ ERKREINQRA LQGQFTNIKE VLATNLKTNR GLEEIIREIE HHISKLPHVG SRLPKTWKQV REILELDSRN YISLEEYLSI CEQNGFEKRE YKLQLSGYLH DLGICLHFQD DPLLNKTVIL KPEWGTAAVY KALDNQTVRN NFGEFTKDDL ANIWNEEKYV NMRDELLQLM IRFKLCYKIY GNSQTYIAPQ LLTENQPEYD WDESNNLILR YTYEFLPKGI ITQFIVAMHK DIEEQKYVWK SGVILKKNET RAEVIEYYNI REIKIRVSGR EKQYLMTIVT YEFDKIHSSY NNRLKYNQLI PCNCNVCQNN QNPTSYKFEI LKNRINNGKE TIECDYPPFY EVNIKSLIDD VINISEYDKL SQSNSHPNNV YNKYYSYTNY GGDWNIDAKD TNSQIMGNKT MNKNIDKSRK IENKDGNITG NVLGDESNIE GNITTHSPAP KTEKPDKFGW LDFLTWAQNG QMIFYLASAT IGIFVLIIYP KLFPSGFPKL IEKVQEIFPA PQEEIESEQN Q
|
| |