Gene Tery_3417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3417 
Symbol 
ID4244454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5229829 
End bp5231175 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content33% 
IMG OID638108397 
Productpentapeptide repeat-containing protein 
Protein accessionYP_722987 
Protein GI113476926 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.462451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACC AAAAATCACT TGATAAATTA CAGAAACTAA TTAAAGATAC TATTAATCAA 
GCAATAAATA ATGGGGATCA AGCTAGCCAT CAAAAATTGA CTAACAATTT TGCTGGTGTC
AACCTTGAGT CCGTTGACCT GAGTATGAGT GATCTAGATG ATGTTAATTT GAGTGAAACT
ATTCTCCGGG GTGCTTATCT ACTTTGTGCT AATTTGAAAA GAACTAACTT TAGTAATAGT
GATCTGAGTG GTGCTAATCT AAGTGGGGCA ATAATGTGGT TTGCTAACTT TAGTAAAGTA
AATTTTAAAG GGGCAGATAT CAGAGGTGCT AGTTTCAAAA GTGCTAACCT CAAAGGTGCT
GACCTAGAAG GTGCTAACCT CTGGCGTAGT GAGATTACAG ATGCATACTT AATTCAAGCT
AATTTATTAT ACGCTAATTT AATTCGTGCT AATCTTAGAA ATAGTAATCT GACAAATGTT
AACTTGAGTT ATGCGGAACT TAATGATGCA AACCTCAACG AAGCTAATCT TATAGGTACT
AATTTTAGTT ATGCTAATTT GAGTAATGCT ACTCTTAAAG ATGCTAATCT CAAAGATTCT
AATTTATCTA ATGTTAATCT TGTTGGAACT CAATTAAATG GAGCTAATCT TGAAGGTGCT
AATCTTGAAG GTGCTAATCT TATAGGTACT AATTTTAGTG ATGCTAATCT TAATTACACA
AAGTTGAGGA ATACTAATTT AAATCATGTT AATTTAAGAG GGGTGAAAAT TAATCATGGA
ACAGAATTAG ATAATAAGTG GTATCTGGTT TGGGATATTA TTAATCATGG TGCTTATGGG
AGAAATTTAA GTGGTGTTCA TCTGGAAAAT GCTGATCTTA AGGGTGCTAA TATCGGCACT
GCTAATTTAA CTAATGCTAA TCTGGAATAT GCTAACCTCA GATATGCTAA CCTTAGTAAT
GCTAATCTCG CTCATATTAA ATTAACTAAT GCTAATCTAG CAGATATTAA TTTGATTAAG
GCAAATTTAT ATAGTGCAAA TATGCAAGGA GCTAACCTGA GCAATACTTT ATTATTTAAT
TCTATTATGA CGGGTGCTTT ATTAAATCAA GCTAAGCTGC TAAAAGCACA ATTGTGTGAT
GCGGAATTAT CAAATGCAAA GTTAATTATG GCAAATTTAA TGAACGCTAA CTTGAAAAAT
GCTAGGTTAT TAGGGGCTGA TTTAAGGAAG ATAAATCTAG AAGGTGCAGA ATTAGATGGT
GCTATATTTG GTAATAATAT GGGAGTTTCT GAGAAGATGA AGCAGGATTT AATTAAACGT
GGGGCTGTTT TTAAAGATAA GTTTTAG
 
Protein sequence
MANQKSLDKL QKLIKDTINQ AINNGDQASH QKLTNNFAGV NLESVDLSMS DLDDVNLSET 
ILRGAYLLCA NLKRTNFSNS DLSGANLSGA IMWFANFSKV NFKGADIRGA SFKSANLKGA
DLEGANLWRS EITDAYLIQA NLLYANLIRA NLRNSNLTNV NLSYAELNDA NLNEANLIGT
NFSYANLSNA TLKDANLKDS NLSNVNLVGT QLNGANLEGA NLEGANLIGT NFSDANLNYT
KLRNTNLNHV NLRGVKINHG TELDNKWYLV WDIINHGAYG RNLSGVHLEN ADLKGANIGT
ANLTNANLEY ANLRYANLSN ANLAHIKLTN ANLADINLIK ANLYSANMQG ANLSNTLLFN
SIMTGALLNQ AKLLKAQLCD AELSNAKLIM ANLMNANLKN ARLLGADLRK INLEGAELDG
AIFGNNMGVS EKMKQDLIKR GAVFKDKF