Gene Tery_2769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2769 
Symbol 
ID4244802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4291521 
End bp4294466 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content38% 
IMG OID638107828 
Productextracellular ligand-binding receptor 
Protein accessionYP_722425 
Protein GI113476364 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.394148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGAA TTTCTCCTGA ACTCTATTCT AAACTGAGTG CATCTCTGTC AAGGTGTGAG 
CAATTTAACA GCAATGAAGA ACTACAGAGT TTTTTCAACG GTCACCGTCA GCTCTCCCTT
TGGGCTAATA GTTTACCACA AGGTATCAAT ACGGACCAAC GAGTAACACA CGTGATTGGC
TTTTTAGTGG GTAAATATCA TACCAATAAG AATGGACTCG TTATTTTGGT AAGGTTGCTT
TGTGAGACGT TTGACCGAGA AGATAGTCGG CATAGCACTT TATCTAAATT AGAAAAAGAG
CTTAATAAAC AGCTGAATAG TGTTGATAAC AAAACTAGCT TTAATTGGTT ACACTTAACC
GACTTTCATC AAGTCATAAA AGAACAGAAT GGGGGATCAT CTCGCGTAAA AAAAAGATTT
TTTGAAGATT TAAACAGGCT CCACGAGAAA AGTGAGCCGT GGGACTTAGT GCTATTTACT
GGTGATTTGA CTCAGGAGGG TAGTGCTGAA GAATTTGAAA AGCTCGATCA GCTTTTGGAA
CAACTATGGG CAAAATTCCA AGAATGGGGT TCGTCTCCTA AGCTACTAGC AATACCTGGA
AATCACGATT TAGTTAGACC TAATCAAAAA GAACCAGCGG TAAGACTTCT AAAACGGTGG
TCTGAGGAAC CAGAGGTACA ACAAGAATTT TGGGAGGATG CAAAATCAGA CTATCGTCTA
GTCATGGCTG AAGCTTTTGC AAATTATATG GCTTGGTGGA AAAGACAACC CTGGAAGCCA
GAACACCTAA AAGCTGGTAT TCTACCAGGA GATTTTTCAG CCACTATCGA AAAGAATAGT
GCAAAACTCG GAATTTTGGG TTTGAATACA AGCTTTCTTC AGCTTATTGG TGAAAATAAT
GAGGGCAAAC TGGCTATTCA TACCCACCAA TTTCACCAAG CTTGTGACGA CAATAACACC
CAAAATTGGG CTAGAAAACA TCGTGCCTGT TTGTTACTGA CCGATCATCC TGTCGATGCT
CAAATGCACT TGAATACAAA AATTACTGCC GGTTACCCCT TTGTCCTTTA TCTACGTGAC
CATACTGATA CTAAAGCTAA ACAAATTTGG CATTACTATA CAGTTAATAA AATTGAACTT
GAACTCATTA AAGGCAAGGA TCAGTTGAAA TCTTGGTTTC TTGAAGCATC CCGAGAAGGG
GGAAAAATCC AAGAATGGGA AATTCCTATT TTCAATGATC GTCTTAATTC AATACCATCC
AGACCCCCTG TGGGAAAAGA AGAGCTGATT GGAGATGAAC CTGGGCTAAC CCAAAATCTT
ACCAGGGGGA TAACTCTGTT TTTGTCCCAA AATCCTACCG GTTGGATAAT TTCGTTTTTG
TTGATTCTGA TTGATATTTT TTCAATTTAT CGCTGGTTAA TTTATCTTCC TAGACATGAG
CCTCTTGGTG ATAACATTAG TATAGGTGAG GAAATTCTGG TAGAATCATC TAGACCTCTA
GAAAAAGAAA ATGGTGTAAA AGAAGTTCAA TACTGCCAGA AATCCTGGAA TCATTTTCAA
GCTATTTGGA AAAATAATAC TAAAATACAG AATTGTTTTG CTGGAGTGGC AAATACTCTT
AAGGAAAGCT GGAAGAATGA AAGGCGGGAC CCAGAAACTT TAATTTATAT CAATAATGCT
TTTTTAGAAC ATATAAAAGC TGATCCTTAT ACTATTGCAG TTGTAGTTCC AGTTATAGAT
CAAAATGAGC AGGTGAATGG AGAACTTGCT GAAGAAATTT TGCGGGGAGT AGCTCAAGCA
CAAACAGAAG TTAACTTAAG TTTATTCAAG AAAAAGCAAT TCTCTTTTTT GCCTTTTCCA
AATGTAGATC TAAAAGCCAA AAGGTTTAAT GATAATGATA ATGATAAAGG CTTAAAAGTT
ATTATTGCTA ATGATGCTAA CTCAGAGGAT GGCGCAGAGA ACGTTGCGAA GAAAATAGTC
GGCCGCCCAG AAATTTTAGG TGTTGTTGGT CATTGGGCTA GTGAGATGAC TATGGCGACT
AAAGATATAT ATGATGATGC AAAACTAGTA ATGGTTTCCC CAGGCACAAC CACTTCTAAA
CTCACCGCCG AAGAAAGGGT AGACGTTTTT TTCCGGACTA CCACAACGAC TATTGAGCAA
GCAGAGAACA TGGTTAACTC TTTGCTGAAT AAAAATCAAA CAAGAGTTGT AATTTTTTAT
AATCCTAAGA GTTATTATTC CGCTGATTTA AAAAAACAAT TTGAAGAAAA ATTTGAAGGA
AAAGGAGAAA TAATTAATTT ATCACTTAAT ATGGATTATT TTGCTGAGGA TAATTTTAAG
GTTAAAGATG CCATCAATGA GGCCCGCAGA CAAGCAGGAG ATCAGGAATT CGCAATTGTT
TTAATATCAG ATGGTCAAGT AAGTGATGCT TTCGATAATA GTCTCAAAAT TATTGAAGAA
AATGGTGGTC AAAATTGGAT AGTAGCTAAT TGGTCAGTTT ATAGCCCGAG AACCTTGAAA
ATTGCTCAAG ATCAATCCCA AGAAACACGA TATCAACTCC TAGAAAAACT GATTTTGATT
GTTCCTGGGC ATCCTCTCAA CAATTCTGAT TTTTTTGATA CCGCTGTCAA ACTTTGGGAA
GGTTATGTTA GCGCCCGTAC AGCTTTTAGC TATGACGCAA TGCAGGTAAT TCTTAAAGGT
ATTCAGGAAC AAGGTACTCG CCCTACTAGC AAAGGAATTC AGAAGACATT GGCAGATGAA
AATTTCATCG TTCAGGGGGC TACAGGAGAA ATTATATTTA AATCAGGGAC TGGCGATCGT
CAAAAAGTAC CGTTGAACTC AATTCGGGTT TATCGTTGTC CGAGTCAGCC GTCCGGTTTC
ATGTTTATTC CTGACAAATT TTCAACACCT GAAGAAGCAA AGGTGAAATG CTCGAATTCT
GAGTAG
 
Protein sequence
MGGISPELYS KLSASLSRCE QFNSNEELQS FFNGHRQLSL WANSLPQGIN TDQRVTHVIG 
FLVGKYHTNK NGLVILVRLL CETFDREDSR HSTLSKLEKE LNKQLNSVDN KTSFNWLHLT
DFHQVIKEQN GGSSRVKKRF FEDLNRLHEK SEPWDLVLFT GDLTQEGSAE EFEKLDQLLE
QLWAKFQEWG SSPKLLAIPG NHDLVRPNQK EPAVRLLKRW SEEPEVQQEF WEDAKSDYRL
VMAEAFANYM AWWKRQPWKP EHLKAGILPG DFSATIEKNS AKLGILGLNT SFLQLIGENN
EGKLAIHTHQ FHQACDDNNT QNWARKHRAC LLLTDHPVDA QMHLNTKITA GYPFVLYLRD
HTDTKAKQIW HYYTVNKIEL ELIKGKDQLK SWFLEASREG GKIQEWEIPI FNDRLNSIPS
RPPVGKEELI GDEPGLTQNL TRGITLFLSQ NPTGWIISFL LILIDIFSIY RWLIYLPRHE
PLGDNISIGE EILVESSRPL EKENGVKEVQ YCQKSWNHFQ AIWKNNTKIQ NCFAGVANTL
KESWKNERRD PETLIYINNA FLEHIKADPY TIAVVVPVID QNEQVNGELA EEILRGVAQA
QTEVNLSLFK KKQFSFLPFP NVDLKAKRFN DNDNDKGLKV IIANDANSED GAENVAKKIV
GRPEILGVVG HWASEMTMAT KDIYDDAKLV MVSPGTTTSK LTAEERVDVF FRTTTTTIEQ
AENMVNSLLN KNQTRVVIFY NPKSYYSADL KKQFEEKFEG KGEIINLSLN MDYFAEDNFK
VKDAINEARR QAGDQEFAIV LISDGQVSDA FDNSLKIIEE NGGQNWIVAN WSVYSPRTLK
IAQDQSQETR YQLLEKLILI VPGHPLNNSD FFDTAVKLWE GYVSARTAFS YDAMQVILKG
IQEQGTRPTS KGIQKTLADE NFIVQGATGE IIFKSGTGDR QKVPLNSIRV YRCPSQPSGF
MFIPDKFSTP EEAKVKCSNS E