Gene Tery_1337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1337 
Symbol 
ID4242797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2032658 
End bp2034337 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content41% 
IMG OID638106515 
ProductFHA domain-containing protein 
Protein accessionYP_721126 
Protein GI113475065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.172478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTTCC CTTCAGAAAG TCCTTGCCTC AGCATAGCCC TCGCGCGCCT AAGCATCCCA 
GAAGCTAATC ACTTTGCCGT CTGGGTCATG CAGGCTCCTT TTCAGCGAGG ATATGTCCAC
CATGACCAGG TCTGGCCAGA AACCTTATCA AAAGCTTGGC AGACTTGGTT AGAGGTCTTC
TCTCCCCAAA GTTTACCGGC TATTCCAATA GGTAATTCTC AACCGACATT AGCATCAACC
CCAAATATTC ATTCTGTTTC TAAATCTGGT ATCAAATTAA ATCTTACCAG TCGCCTAATG
CAAAATCTAG GTATTAATCT CTGGCAGTGG CTATTTCAAG GAGAAATTGC TCATAGTCTC
CACCAAAGTC AAGGTATTGC GATTGGCCAA GAGCTGCCCT TACGGGTAAG GTTAGATATT
CGGGAACCTG AATTAATTGC TCTACCCTGG GAAATTATGC AACCAGGAGT GGCTTTACCC
GCTTTTTCCC TAAGTCAGGA AATTTTGTTC AGCCGTACCA CCAGTGATGT TAACCCCTTA
CGCAACCAAG CACCTTCCCA ATGCCTAAAT ATTTTGTTGG TAATTGGGGA AAGTAGCCCT
AAGGGAAAAT CTTCTACAGG TAATGGCCAT AAATCTCTAG TTCTATCCAA ATTAAAACTT
GAGGAAGAAG TTGCCCAATT AATTGAGGTT TTGAAGGCTC GCAACATCAC AAACTCAAAT
ATACCTAGAG TTAATCCTAC TATTCCTTGT AGAGTGGATA CACTAATTCA GCCCACTCCC
AAAGAATTAA CTTCCTATCT AGATAAAAAA ACTTATAATG TAGTTTTTTA TGCTGGTCAC
GGTATACCAG GTCCAGATGG AGGTTGGTTA TTTTTAGCGC CTGATACAAC CCTCAATGGT
ACTGAGTTAG CACAAATTTT AGTGCGCAAT GGAGTAAGGT TGGCAGTATT CAATGCTTGC
TGGGGAGCAC AACCTGCTAC AGAGCGTCTA TCTTCTGGTG AGGTACAGGC AATACCACGC
AGTAGTCTGG CAGAAGTATT AATCCATCAT GGAGTACCTG CGGTTTTAGG GATGCGAGAT
GAAATTGCTG ATCGAGAAGC TTTAAGTTTT ATTCAAGTTT TTGCTCAAAG TTTGACGGAG
GGAATGTTGA TAGATCAGGC AGTAGTAATC GCAAGACAGC AGTTATTAAC TCTTTATAGG
TTTAATAAGC CAGCTTGGAC TTTGCCAGTA TTGTATATGC ACCCGGAGTT CAATGGTCAA
TTAGTTCAAG TATTTGATGA ATTAGTAACT CAACTACCTA CTAACTCTCA GACTTGGATT
AACGGTTACA CTTCTAAAGC TTTTTTACGT TCTCAAGATG ATAATAATCA AGTTTGGCCA
ATTTTGATTG ATCCAATCGC TGTTGGACGT TCTCAGGAAA ATGATGTGGT GATTTGGGAG
CGGTGGGTTT CCCAAAAACA CGCGGAAATT TTTTGTCGCT GCTTGCCGAA TGAAGAACTT
GAACCTACTT ATTTTTTACG AGATATTTCT CGTTTTGGGA CTTTGATTTA TCGGTCTGGT
ACTTGGCAAA GAATACATCG TGATCAGCTT GTTATAAAAT CAGGAACACT GTTAAAATTT
GGCAGTTCTC AAGGTCAAGT TTTTGAGTTT GTGATTGAAA CAACAGAAGA CCTAAGTTAG
 
Protein sequence
MLFPSESPCL SIALARLSIP EANHFAVWVM QAPFQRGYVH HDQVWPETLS KAWQTWLEVF 
SPQSLPAIPI GNSQPTLAST PNIHSVSKSG IKLNLTSRLM QNLGINLWQW LFQGEIAHSL
HQSQGIAIGQ ELPLRVRLDI REPELIALPW EIMQPGVALP AFSLSQEILF SRTTSDVNPL
RNQAPSQCLN ILLVIGESSP KGKSSTGNGH KSLVLSKLKL EEEVAQLIEV LKARNITNSN
IPRVNPTIPC RVDTLIQPTP KELTSYLDKK TYNVVFYAGH GIPGPDGGWL FLAPDTTLNG
TELAQILVRN GVRLAVFNAC WGAQPATERL SSGEVQAIPR SSLAEVLIHH GVPAVLGMRD
EIADREALSF IQVFAQSLTE GMLIDQAVVI ARQQLLTLYR FNKPAWTLPV LYMHPEFNGQ
LVQVFDELVT QLPTNSQTWI NGYTSKAFLR SQDDNNQVWP ILIDPIAVGR SQENDVVIWE
RWVSQKHAEI FCRCLPNEEL EPTYFLRDIS RFGTLIYRSG TWQRIHRDQL VIKSGTLLKF
GSSQGQVFEF VIETTEDLS