Gene Tery_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3798 
Symbol 
ID4243746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5836695 
End bp5839820 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content33% 
IMG OID638108732 
Productsmall GTP-binding protein 
Protein accessionYP_723316 
Protein GI113477255 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTATT CTGAAACAAG CGAGCAGTAC TATATATCAG AGGATGTAAA AAAAAGAATA 
CAAGAAGCTA AATATCAAAA GCTTAAGTGG TTATATTTAA GCGGGTGTAA ATTAACTGAA
GTTCCTGGTG ATGTTTGGGA ATTAGAGCAG TTAGAAGTAT TAGATTTAGG CAGCAATGAA
TTAACAAGTC TGCCGGAATC AATTGGCAAA CTCTCCAATT TAACTTCGCT TTATTTAGTC
AATAATAAAT TAACAAGTCT GCCGGAATCA ATCACCAAAC TCTCCAATTT AACTGAGCTT
TATTTAGATG GCAATCAATT AACAAGTCTG CCGGAATCAA TCACCAAACT CTCCAATTTA
ACTGAGCTTT ATTTAAGTGT TAATAAATTA ACAAGTCTGC CGGAATCAAT TGGCAAACTC
TCCAATTTAA CTTCGCTTGA TTTAGGTGGT AATCAATTAA CAAGTCTGCC GGAATCAATC
ACCAAACTCT CCAATTTAAC TGAGCTTTAT TTAGGTCACA ATCAATTAAC AAGTCTGCCG
GAATCAATCA CCAAATTATC CAATTTAACT GAGCTTTATT TAGGTCACAA TCAATTAACA
AGTCTGCCGG AATCAATCAC CAAATTATCC AATTTAACTT CGCTTGATTT AAGCTGGAAT
AAATTAACAA GTCTGCCGGA ATCAATCACC AAACTATCCA ATTTAACTTC GCTTTATTTA
GGTAGTAATC AATTAACAAG TCTGCCGGAA TCAATCACCA CACTCTCCAA TTTAACTGTG
CTTGATTTAG GTAGTAATCA ATTAACAAGT ATGCCGGAAT CAATCACCAA ACTCTCCAAT
TTAACTGAGC TTTATTTAGA TGGCAATCAA TTAACAAGAC TGCCGGAATC AATCACCAAA
CTCTCCAATT TAACTAAGCT TGATTTAAGG AATAATCAAT TAACAAGACT GCCGGAATCA
ATCACCAAAC TCTCCAATTT AACTAAGCTT AATTTAAGCT GGAATAAATT AACAAGTCTG
CCGGAATCAA TTGGCAAACT CTCCAATTTA ACTTCGCTTT ATTTAAGGGA TAATCAATTA
ACAATTCTGC CGGAATCAAT CACCACACTC TCCAATTTAG GATGGCTTTA TTTAAACAAT
AACCCCCTCG AAAACCCACC AATAGAAATT GCTACAAAAG GAATACAAGA AATTAGAGAC
TATTTCCAGC AAGAACGAGA AAAAGGAATA GATTATATCT ATGAAGCAAA GTTACTAATT
GTTGGAGAAG GTGGAGCCGG CAAAACAACT TTAGCAAATA AAATTCTCGA CCAAAACTAT
CAACTCAAAG ATGAAGATAC AACCAAAGGA ATTGAAGTTC ATCAGTACAA ATTTCAGACA
AAAAACCAAA ATGACTTTCA AATAAATATT TGGGATTTTG GCGGACAAGA AATTTATCAC
GCTACCCACC AATTTTTCCT GACCAAACGC TCCCTATATA CCCTTGTCGC TGATACCCGC
AAAGAAGATA CAGACTTTTA TTATTGGCTC AATGTAGTAG AATTATTAAG CGGTAATAGC
CCCTTATTAA TAGTTAAAAA TGAGAAACAA GAACGGAAAC GAGAAATTAA TCAACGGGCA
TTACAAGGGC AATTTACTAA TATTAAAGAA GTCTTAGCTA CCAACCTTAA AACTAATCGT
GGCTTAGAAG AAATTATCAG AGAAATCGAA CATCATATTA GTAAGTTACC CCATGTTGGC
AGTCGTTTAC CTAAAACTTG GAAACAAGTC CGGGAAATAT TAGAATTAGA TTCCCGTAAC
TATATTAGTC TAGAAGAATA TTTATCTATT TGTGAACAAA ATGGCTTTGA AAAAAGGGAA
TATAAATTAC AATTAAGTGG CTATTTACAC GACTTAGGAA TTTGTCTTCA TTTTCAGGAT
GACCCACTCT TAAATAAAAC AGTTATTCTC AAACCAGAAT GGGGAACAGC GGCAGTTTAT
AAAGCCTTAG ATAATCAGAC AGTTCGTAAT AATTTCGGCG AATTCACCAA AGATGATTTA
GCCAATATCT GGAATGAAGA AAAATATGTG AATATGCGGG ATGAACTGCT GCAATTAATG
ATTAGATTTA AACTTTGTTA TAAAATTTAT GGCAATTCTC AGACCTATAT TGCTCCTCAA
CTCTTAACAG AAAATCAACC AGAATATGAC TGGGATGAAA GTAATAATTT GATTTTACGT
TACACCTACG AATTTCTGCC CAAAGGAATT ATTACTCAGT TTATTGTAGC TATGCATAAA
GATATTGAAG AACAGAAATA TGTCTGGAAA AGCGGAGTTA TTCTCAAGAA AAATGAAACA
AGAGCAGAAG TAATTGAATA TTACAACATA AGAGAAATAA AAATCAGAGT TTCTGGGCGG
GAAAAACAGT ATTTAATGAC TATCGTAACT TACGAATTCG ATAAAATTCA CAGTTCCTAT
AATAATCGAC TAAAATATAA TCAGTTAATT CCTTGTAATT GTAATGTTTG TCAAAATAAT
CAGAACCCAA CATCATACAA ATTTGAAATT TTAAAGAACC GAATAAATAA TGGTAAGGAA
ACAATTGAAT GTGATTATCC TCCCTTCTAT GAAGTTAATA TAAAAAGCTT AATTGATGAT
GTAATAAATA TAAGTGAGTA TGATAAACTC TCACAGTCTA ACTCTCATCC AAATAATGTA
TATAATAAAT ATTATAGTTA TACTAATTAT GGCGGAGACT GGAATATAGA CGCCAAAGAC
ACTAATAGTC AAATTATGGG AAATAAAACC ATGAATAAAA ACATTGACAA AAGCCGAAAA
ATCGAAAATA AAGACGGAAA TATTACAGGT AATGTACTGG GTGACGAAAG TAATATCGAA
GGAAATATAA CCACACATTC TCCGGCGCCA AAAACCGAAA AACCCGATAA ATTTGGATGG
CTAGATTTTT TGACTTGGGC GCAGAATGGG CAAATGATTT TCTATTTAGC TTCTGCCACT
ATCGGCATTT TTGTACTTAT TATTTATCCG AAACTATTTC CATCTGGTTT CCCTAAATTA
ATAGAGAAAG TTCAAGAGAT ATTTCCAGCT CCGCAAGAAG AAATTGAGTC CGAACAAAAT
CAATAA
 
Protein sequence
MTYSETSEQY YISEDVKKRI QEAKYQKLKW LYLSGCKLTE VPGDVWELEQ LEVLDLGSNE 
LTSLPESIGK LSNLTSLYLV NNKLTSLPES ITKLSNLTEL YLDGNQLTSL PESITKLSNL
TELYLSVNKL TSLPESIGKL SNLTSLDLGG NQLTSLPESI TKLSNLTELY LGHNQLTSLP
ESITKLSNLT ELYLGHNQLT SLPESITKLS NLTSLDLSWN KLTSLPESIT KLSNLTSLYL
GSNQLTSLPE SITTLSNLTV LDLGSNQLTS MPESITKLSN LTELYLDGNQ LTRLPESITK
LSNLTKLDLR NNQLTRLPES ITKLSNLTKL NLSWNKLTSL PESIGKLSNL TSLYLRDNQL
TILPESITTL SNLGWLYLNN NPLENPPIEI ATKGIQEIRD YFQQEREKGI DYIYEAKLLI
VGEGGAGKTT LANKILDQNY QLKDEDTTKG IEVHQYKFQT KNQNDFQINI WDFGGQEIYH
ATHQFFLTKR SLYTLVADTR KEDTDFYYWL NVVELLSGNS PLLIVKNEKQ ERKREINQRA
LQGQFTNIKE VLATNLKTNR GLEEIIREIE HHISKLPHVG SRLPKTWKQV REILELDSRN
YISLEEYLSI CEQNGFEKRE YKLQLSGYLH DLGICLHFQD DPLLNKTVIL KPEWGTAAVY
KALDNQTVRN NFGEFTKDDL ANIWNEEKYV NMRDELLQLM IRFKLCYKIY GNSQTYIAPQ
LLTENQPEYD WDESNNLILR YTYEFLPKGI ITQFIVAMHK DIEEQKYVWK SGVILKKNET
RAEVIEYYNI REIKIRVSGR EKQYLMTIVT YEFDKIHSSY NNRLKYNQLI PCNCNVCQNN
QNPTSYKFEI LKNRINNGKE TIECDYPPFY EVNIKSLIDD VINISEYDKL SQSNSHPNNV
YNKYYSYTNY GGDWNIDAKD TNSQIMGNKT MNKNIDKSRK IENKDGNITG NVLGDESNIE
GNITTHSPAP KTEKPDKFGW LDFLTWAQNG QMIFYLASAT IGIFVLIIYP KLFPSGFPKL
IEKVQEIFPA PQEEIESEQN Q