Gene Tery_4781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4781 
Symbol 
ID4246435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7344438 
End bp7346654 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content32% 
IMG OID638109630 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_724206 
Protein GI113478145 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.409511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0812036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACTA TGGATACTCA GATAAGCAAA ATTGTAGATC AACGGTATAG CTTGATCCAA 
CCTATAGGTT TAACTATTTT AGGACAGACA TATTTAGCAC TAGATACTCA CCGTCCTGGT
TCCCCACAAT GTGTGGTTAG AGAAATTCGA TTAGGTAATT TTACCTCAGG AAATAGAGAT
CTAATACTAT CATTATTTAA AGAAAAAATA GAAAATTTTT TTAGCTTGAG TCAACACGAG
GGCTTGCCAC ATTTATTAGC TTATTTTGAA GAAAACAATA ATATATACTT AGTTGAAGAC
TATATAGTAG GTATATTGTT ATCAGAAGAA TTAGCTGGTG GTAAATGCTT AATAGAATTG
GAAGTAATCA AGATTTTAAA AGAGATACTA GAAATTTTAG TTTTCATTCA TAATAAGGGG
GAAGCTCATG GAAAAATTAA ACCAGGAAAT TTGGTCAGAC AAGTATTAGA TGGAAAATTA
TTCTTAATGA ATTTTGGGTT AGAAAGGCAA ATTAAGAGAA TTTTAGAATT AAATGAGGAA
CCATTAATAT CAGAAAATTA TGGAGATAGC AATTATTATA GCTTACTTTA TATCCAATCG
GAAAAAAATC AAGAGCAAGT AGACAAAAAA AGTGATATTT ATGCTTTAGG GATAGTTGCT
ATTCAAGCAC TAACAGGATT ATCACCTCAA GATTTACTGG AGCAAAAAAT AATTAACAAT
ACTCTAAGAA TAGAAATACC TTGGCAGAGT TTGCAAGTAT GTTCCCTACC ACTATCAAAT
GTGATTGACA AAATGGTAAA TAATCAATGT GAAGAAAATT ATGAGTCAGC AACAGAAGTC
CTAGCAGAGT TAAGTAAGAT ATTGGTATCT ACTGAAACCA AATTAATTCC ACAATCAGAA
ACTATAATTA CAAGACCTAG AGCAAGAGAG ATTCCATTAC TACCAACATT ACCCAGATAC
AAGAAAAAAA ATATATTAAT TGCTAGTTTA GTAATAATAA CAGGAGTAGC GATCGCTTAT
TTCTGGCAAT ACCAATTTAT AGGGCAAACA CTCTATAAAA AAGGTCAAGA GTTAGCTAAA
CAAGGAAAAC AGCAAGCAGC GATCGCCAAC TACACTGAAG CACTCAAACT CAATCCCAAA
AAGGCATCAA TTTACTATAA AAGAGGAAAC TCTTACTATT CTCATAGATC TTATGAAAAG
GCAATAAAAG ATTATACAGC AGGGATTAAA ATCAAAGCTA ACTATGAAGA TGCTTACTAT
CAACGTGCTC TAGTTTATTA TGAACTAGAT AACAAAGACA AAGCTATGAC CGATTTAACG
CAAACATTAA GAATAAATCC TAACTACACC CAAGCTTACA AAAAACGAGG GCTAATTTAC
TACGAAATTG GAGACTATAA AAGTGCTATC CAAGACTACA GTGAATCAAT TAGACTTAAT
CCCAAAGATA GTAAGACCTA TATTAATAGG GGAATAGCAA GAGGAGCGCT AGAGGATCAA
GTCGGTGCAA TTAGTGACTA CACTCAAGCT ATCAAACTGA ACCCCAATGA TGTAAAAGCA
TATTATTATC GAGGTAAGTC TCTGTTTAAG ATGTTAGACT ATCAAGGTGC AATAGAAAAC
TATAATCAAT TCTTAGAAGT TAAACCTGAT GATGCCGATG CTTATACAAA TAGATGTAGC
GCTTACCTCC ATAAGGGAAA TGATTCATCA GCAATAGCAG ATTGTCAACA AGCAATAGAA
ATAAATCCTC AAGATTTTCT AGCATATCAT AATCTATGTA TTGCCTATTT CAATCTAGGG
GAATATCAAA GGGCAACAGA AAACTGTAGT ATTGCTATTG GTATAGATAA GAACAATGCA
AAAGCTTATA CAAATAGAGC TTTAGCTCAG TCTGCTCGTG GCTATTTACA AGAAGCGATC
AAAGACTTTA CTACAGCAAT TGAAATTAAC CCTCAAGATG ACCTTAACTA TAGTCATCGG
GGAATGATTT TCTCAGTTCT CAAAAACTAT AATCAAGCAA TTAAAGATTT TTCTCAAGCA
ATTAGACTCA ATTCAAATAA TGCTAAAGCC TATTATAATC GAGGAGTTAT TCTACATAAA
TTAGAAGACT TACCAGCAGC AATAGTTGAC TTTAATAAAA GTGCTAGTTT ATTCTTAAAA
CAAGATCAAA TAAAAAATTA TCAAAACTCG CTAAATATGA TTAAAAAACT ACAGTAA
 
Protein sequence
MGTMDTQISK IVDQRYSLIQ PIGLTILGQT YLALDTHRPG SPQCVVREIR LGNFTSGNRD 
LILSLFKEKI ENFFSLSQHE GLPHLLAYFE ENNNIYLVED YIVGILLSEE LAGGKCLIEL
EVIKILKEIL EILVFIHNKG EAHGKIKPGN LVRQVLDGKL FLMNFGLERQ IKRILELNEE
PLISENYGDS NYYSLLYIQS EKNQEQVDKK SDIYALGIVA IQALTGLSPQ DLLEQKIINN
TLRIEIPWQS LQVCSLPLSN VIDKMVNNQC EENYESATEV LAELSKILVS TETKLIPQSE
TIITRPRARE IPLLPTLPRY KKKNILIASL VIITGVAIAY FWQYQFIGQT LYKKGQELAK
QGKQQAAIAN YTEALKLNPK KASIYYKRGN SYYSHRSYEK AIKDYTAGIK IKANYEDAYY
QRALVYYELD NKDKAMTDLT QTLRINPNYT QAYKKRGLIY YEIGDYKSAI QDYSESIRLN
PKDSKTYINR GIARGALEDQ VGAISDYTQA IKLNPNDVKA YYYRGKSLFK MLDYQGAIEN
YNQFLEVKPD DADAYTNRCS AYLHKGNDSS AIADCQQAIE INPQDFLAYH NLCIAYFNLG
EYQRATENCS IAIGIDKNNA KAYTNRALAQ SARGYLQEAI KDFTTAIEIN PQDDLNYSHR
GMIFSVLKNY NQAIKDFSQA IRLNSNNAKA YYNRGVILHK LEDLPAAIVD FNKSASLFLK
QDQIKNYQNS LNMIKKLQ