Gene Tery_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3476 
Symbol 
ID4244476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5353427 
End bp5354902 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content28% 
IMG OID638108450 
Producthypothetical protein 
Protein accessionYP_723039 
Protein GI113476978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGC CTGTTATTTT AGATATAGTC ATTGGCTTAA TCTTTGTTTA CCTTACTTTC 
AGTTTACTAG CTAGTGAAAT TCAATCAATT TTGACAACTA TTTTGCAATG GAGAGCAGTC
CATTTAAAAC AATCAATTGA AGAATTAATA AGTGGTCAAA CTTATATTGA TAATGATAAT
TTAGAATCAG ATTTACACAT CAAAAAGGTA CAGCAACTAA CTAATTCTAT ATATAAAAAT
CCATTAATTA AAGATCTTAA TTACAGTGGT GGCAAAATTG GATTAGAAAG AATATTTAGA
AAATTTACCA ATAGAATTAG TGATTTAATT AGTGGTTTAA CTAGAGTTAA AAAAAATTTT
GACGGGGAAA CAACAGCTCC GTCTCAGATT CCATCAAATA CATTTGCTGC TAGTTTAATT
GATACTTTAA AAATTCCAGA TTTAATTAAA ACAATTAGTA ATTATCAACT CGAAGATTTT
ATCAATAAAA AATTGATACA AATTAATAGT ATTTTAAATG ATCTGGATTT AAAAGATGCT
ACCAAAAATG AAATGAAAGC TAATTTAAAT CATTTATATC AAGAGTTAAA TGTTATATAT
AATAAATATA AAAAAAATGA TTTAGCAAAT ATAAACCTGA CTTTAGACAG AATATTAAAT
CGGTTAAATT TATTTATAAA AAAATCTTTA CAAGATTTGC CACAAATAAC CCATTGTAAA
TTTGAGGAGA AAATGATTTT AGTTAAGCAA GTATTAACAA ATCCCCAAGA GAGAGAAGTT
TTAGTCAGTG AGATTGAACC AAGTTTCTCT AATTTACTAG AGTTCTGGAG AAGGTGGACC
GAAATGGCAA AAGTTTCTCA AGTAGATTTG CAAAGTAGAA AGGGGAAAAT ATATAAAAAA
GTTGAGGAAA CCCTGGAACA GCTACCTGAA TCTTTACAAA ATAGTTTATA TATTTTAGCT
CAACAAGCTA CAACTAAAAT TACTACAACA GGTGATACTT TAAATCAATT TCAAAAAGAA
GTTGAGCAAT GGTTTGATAA TGGGATGGAA AGAGCTGCTA ATGTTTACAA GCGTAATGCT
AGGGGAGTTG CTTTTTTGCT TGGTATTACT ATTGCTATAG CTACTAATCT TGATACGTTA
AATTTAATCG ACCATTTATC AAAAGATTCG TTGATGCGAG CAACTATTAA TTATTATTCT
CAGGAGTTAA TTAATAATAG TTCTAATTCT GATGAGATGG ATATAGAGAA TATTCAAAAT
CAGGTTAATG TTGCATTAGA TGATGTGAAA TTACCTATTG GTTGGGGGAA TGATGTACTA
ACTGATCAAG CAGTAGAAAA TCAATCATCT GATTACTTGA AATGGTTAAA AAGATTATTG
GGTTGGATAA TTAGTGGAAT AGCAATATCT ATGGGGGCTG ATTTTTGGTT TAATTTATTG
AAAAAGATTA TTGAAGTAAA AAATGTGAAA AAATAA
 
Protein sequence
MNLPVILDIV IGLIFVYLTF SLLASEIQSI LTTILQWRAV HLKQSIEELI SGQTYIDNDN 
LESDLHIKKV QQLTNSIYKN PLIKDLNYSG GKIGLERIFR KFTNRISDLI SGLTRVKKNF
DGETTAPSQI PSNTFAASLI DTLKIPDLIK TISNYQLEDF INKKLIQINS ILNDLDLKDA
TKNEMKANLN HLYQELNVIY NKYKKNDLAN INLTLDRILN RLNLFIKKSL QDLPQITHCK
FEEKMILVKQ VLTNPQEREV LVSEIEPSFS NLLEFWRRWT EMAKVSQVDL QSRKGKIYKK
VEETLEQLPE SLQNSLYILA QQATTKITTT GDTLNQFQKE VEQWFDNGME RAANVYKRNA
RGVAFLLGIT IAIATNLDTL NLIDHLSKDS LMRATINYYS QELINNSSNS DEMDIENIQN
QVNVALDDVK LPIGWGNDVL TDQAVENQSS DYLKWLKRLL GWIISGIAIS MGADFWFNLL
KKIIEVKNVK K