Gene Tery_4718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4718 
Symbol 
ID4246372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7239247 
End bp7242270 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content34% 
IMG OID638109578 
Productextracellular ligand-binding receptor 
Protein accessionYP_724154 
Protein GI113478093 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.87534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGA CATCTACAGA AAATTGGCGA AATCCTTATA ATATTGGTGG TTCTATTGAT 
GAACCAAAAT TGTTTTTTGG TCGAGAAAGT TTGTTTCGTG TTATTGAAGA TAATTTGAAT
AATAATCAAC GAATTATTCT TCTACATGGT CAAAGACGTA TAGGTAAATC TTCTGTATTA
AAGCAAATTC CTAAACAGGT ACGTTTGGAT AATCAATTTG TTTTTATTCT GTTAGATTTT
CAAGATAAAT TCCAATGGTC TCTCGATCAA ATTTTCCATT ATATTATCCA AAAAGTTGCT
CAAGAAATTT TTGAGAATTC GGAAGATACC ACACATAGTA TTGATTTAGT TTCACTTGAA
GGCTTAGAAA AAGATCCCAA TGAATTCAGA GAATTATTGC ATTCGATAGT CCAAAAATTG
GGGAGTAAAA ATTTAGTATT ATTACTGGAT GAGTTTGATG TATTTGCAGG CAACAATAAT
GACTCAACAT TTGAAGATTT CTTTGGATAC TTAAAGACAA TTGTGTCTGA GGAAAAGCAA
CTTTTTATTA TTCCTGTTGT GGGAAGGCGA CTAGATGATA TGCCGAAACT TCTAAGATTG
TTTAAGGGTG CGCCTAAGCA GGAAATAGGA TTACTAGAAA GAGTAAGTAC GAAAAGGTTG
ATTACACAAC CTGCTAGAAA ATTTCTGCAA TACAATGAGG GAGCAATAAA CGAAATTTTT
CGACTTTCAG CGGGCCATCC TTACTTTACA CAAGCTATAT GTTACGCTGT GTTTGTACAA
GCACGGGAAG AGGAAAAAAG TGAGATTTTG GAGAGTGATG TAGGCAGGGC GATCGAGAAG
GCTATGGAGT TAAGCGAAGG AGGGTTGGAT TGGTTTCGGG GAGGTTTGCT GAGACTAGAA
AGAGTTTTAT TTTCAGCAGT AGCAGCAGCT CAAGAGAGTG TTAAGCAAAT TCAGTCACCT
CCGGAAAACT TCTTCAATTT GCTTGAAAGA TATGGAGTTC AAATAAAACA ACTACTTCGT
GAGCAACTAC TTCAGGCACA ACAAGCTTTG AGTGAAAATG ATTTTTTAGA TCCATCTGGG
TACAAAGTTA CAGTTGAGTT TGTTCGTCTT TGGTTAATCA AATATTATCC ACTACGCTCA
GAAATATGGG AATTGGAGAA GCTTGATGAT GAAGCTAATA ATTATTACGA AGGAGCTGAT
AAATGGCGGA TACGAGGGGA TATAGATCGT GAGTTAGAGC ATTATAATAT GGCTTTAGAA
CTTAATCCTA ACCACTTTAG TGCTTTATTC CGGTTGGCTG CACGATACAA AAAAAAGCAA
AGATTTCAGG CAGCATTAGA ACTATACGAG AGAGTTTATA AAATTCATTC ACAACGTGGG
AAAGAGGAAT ACATAGAGTT ATTGTTTGGT TATGGTGATC ATTTAATTCA AGAAAATGGT
TCTATAGAGA CAAAGTTAAG TGCAGTTAAA AAGCTATACG AAACGGTCCT AAAAATCGAA
CCATACAATA TGGAAGCTCA AGAGAAACTT AAAGATATTA AAGACAAGGA AAATGCTCTA
TTTTCTGTCA GTAAGAAGTC TAAAACAATT CCAATTCGTA ATGTATTTTT GACAGCAGCT
TTAGCAGCTC CTCTGCTGAT TGGAATAGGA ATATTTTGGG GATTAAAAAC TAAACAAGAT
CCAGATTTTA TACTCGGAAA ATCAGAGCTG GATCAGCTAG AAATAGAAAG ATTTAGCAGT
GGCGAAAAAA GAATGTTTGA TTGGTATAAC AACACAGATG ATAATATTAA GTTTATAAGT
TGTAATGTAA AATTTAAAGT TTCTAATTAC AAAGAAGCAG TTAGTTGTTT TCAAGAATTT
GTAAATTCTA ATCGCAACGA CCCAGAGCCA TTGATTTACT ATAACAATTC TCTAGCTCGC
GAGAATAATA ATAATCCTTT AAAAATAGCT GTAGTTGTAC CAGCAGATAA GAACAGTCAA
AGAGCTAAAG CAATTTTGCG AGGTGTTGCT CAGGCACAAA ATGAATACAA TAAAAACTAC
TATTTCTCAG GAAATAGTAG ATTACTGGAA ATTATTATTG TCAATGACAG TAATGATGAT
GAAATATCTC CAAAAGTAGC TCAAGGAATA GTCAGAGATA AAGAAATTTT AGGGGTAATA
GGACATAATT ATAGTGATGC TACTAAGGCA GCATTAGAAG TATATCAAAA AGAAAAATTG
GCAGTTATTT CTTCTACTAG CACAAGTATA GAATTAAAAG GTGATACTTT TTTTAGAACG
GTGATTGATG ACTCTGTAGC AAGTAAAAAG CTAGCTGAAT ATGTCAAATT TCGATCTATA
GAAAAGATTG TGATTTTTTA TAACAAAGAA AGTTCATTTA GTAAAAGTAT CAATGGTTTT
TTTGATTTTT ACTTAACAAC TTTGAAGCCT GACATAAAGG TTAAAAGTAT AGATTTGAAA
CAATCGTCTT TTGATCTCAA TAAGGAAGTT CGGGTGGCAG CCAATAACCA AGTTGAAGCT
GGAGCGCTAT TTGCAAACAT AGGAACAATT GACTTAGCAA TAGAGGTTGC TAAAGCTAAT
TTTAAATTGC CAGAAAGCCA AAGATTAAAA TTAATTGCTG GTGATAGTTT TTATAATTGT
GATATCTTAG ACAAAGGTGG GGAACCACTT AAAGGTTTGA TTTTATCAAT ACCGTGGCAT
AAACAATTAG ATACAGCAAA AGAGTTTGTA GCTAGAGCAA AAGAGCAGTG GGGAGAGGAG
GAGGAGGTTG GCTGGCGCAC TGCTACTAGT TATGATGCAA CTAAGGCTTT TATTTCTGCG
TTATCTAATT CTGGCGATGA TCCCACCAGG TCTAAAGTGT TGGAAAAACT TAAAGAGGTA
AATGTTCCAG CTAATGAAAC CTCTGGAGAA CATCTTAAAT TTAATCCAGA AGGAGAAATT
GCTGGTCAAG CAGTTCTTGT GGAAGTTGTG GAGTCTCAAA ACCCAGCTTG CTCTAGTTTA
GATTTTAGTT TAATAAAAGA GTAG
 
Protein sequence
MNMTSTENWR NPYNIGGSID EPKLFFGRES LFRVIEDNLN NNQRIILLHG QRRIGKSSVL 
KQIPKQVRLD NQFVFILLDF QDKFQWSLDQ IFHYIIQKVA QEIFENSEDT THSIDLVSLE
GLEKDPNEFR ELLHSIVQKL GSKNLVLLLD EFDVFAGNNN DSTFEDFFGY LKTIVSEEKQ
LFIIPVVGRR LDDMPKLLRL FKGAPKQEIG LLERVSTKRL ITQPARKFLQ YNEGAINEIF
RLSAGHPYFT QAICYAVFVQ AREEEKSEIL ESDVGRAIEK AMELSEGGLD WFRGGLLRLE
RVLFSAVAAA QESVKQIQSP PENFFNLLER YGVQIKQLLR EQLLQAQQAL SENDFLDPSG
YKVTVEFVRL WLIKYYPLRS EIWELEKLDD EANNYYEGAD KWRIRGDIDR ELEHYNMALE
LNPNHFSALF RLAARYKKKQ RFQAALELYE RVYKIHSQRG KEEYIELLFG YGDHLIQENG
SIETKLSAVK KLYETVLKIE PYNMEAQEKL KDIKDKENAL FSVSKKSKTI PIRNVFLTAA
LAAPLLIGIG IFWGLKTKQD PDFILGKSEL DQLEIERFSS GEKRMFDWYN NTDDNIKFIS
CNVKFKVSNY KEAVSCFQEF VNSNRNDPEP LIYYNNSLAR ENNNNPLKIA VVVPADKNSQ
RAKAILRGVA QAQNEYNKNY YFSGNSRLLE IIIVNDSNDD EISPKVAQGI VRDKEILGVI
GHNYSDATKA ALEVYQKEKL AVISSTSTSI ELKGDTFFRT VIDDSVASKK LAEYVKFRSI
EKIVIFYNKE SSFSKSINGF FDFYLTTLKP DIKVKSIDLK QSSFDLNKEV RVAANNQVEA
GALFANIGTI DLAIEVAKAN FKLPESQRLK LIAGDSFYNC DILDKGGEPL KGLILSIPWH
KQLDTAKEFV ARAKEQWGEE EEVGWRTATS YDATKAFISA LSNSGDDPTR SKVLEKLKEV
NVPANETSGE HLKFNPEGEI AGQAVLVEVV ESQNPACSSL DFSLIKE