Gene Tery_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2038 
Symbol 
ID4243642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3176638 
End bp3178710 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content43% 
IMG OID638107151 
ProductFG-GAP 
Protein accessionYP_721754 
Protein GI113475693 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0466844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAC CAACTCCTCA AGACCAATAT ATGCTCGAAT TGGTAAACAG AAGTCGAGCA 
GATCCTCAAG CAGAAGCTGA TTTATATCTA GATGGAAACC TAAACGAAGG ACTTTCTGAA
GGTCAAATTT CCTCCGATGC CAAACAACCT TTAGCATTTA ATCTTAACCT CAATACTGCT
GCTAAAGGTC ATAGTCAATG GATGTTAGAT AATAACATAT TTAGTCATAC AGGAGCCAAT
GGTAAGAAAT CAGGAGACCG TATGCGGGAT TCGGGGTACA TATTCACAGG AACATATGGT
TCTGGGGAAA ATATAGCTTG GAGAGGAACA ACAGGTACGC CCAACTTCAC CACTTTTGTG
GAGAAAAACC ACGAAAACCT CGTCCGCAGT AACAGTCATC GGCTAACCCT GATGAGCAGT
AACCTTCAGG AAGTCGGAAT TTCTTCACTA CAGGGAGAGT TTACATTTGA GGGTATAAAT
TACAACACAG TGATGACTAC CCAAAATTTT GCTTATTCTG GAACGAGTGG CCCATTCATC
ACAGGTGTGG CCTATACTGA TGCTGTTAAG GATGACAATT TCTATACAGT CGGTGAAGGA
ATAAGCGGGA TAATAGTTAC AGCAGTTAAT ACTAATAATA GTAATAATAT TTTCACAACC
ACCACTTGGG ATGCAGGTGG TTATAGTTTA GATGTAGACC CAAATCAAAC TTACGATGTT
ACTTTTTCTG GGGATCTTAA TGGGGATGGT CAGGCAGGGG ATACAGTAAC TTATCAAGTT
AAAGTTAGTT CCGAAAATGT CAAATTAGAT GTAGTCAGTG ACAGCTTACC TACTCCGAAT
GCACCACCCA TAGCGGTAAA TGACACCACC AACACCAGCA AAGGACAAGC AGTCACCTTC
AGCATTACGG AAAACGACTC AGATACAGAC GGTACTCTAG AGCTAGCAAC AGTAGACCTA
GACCCATCTA CAGCCGGACG GCAAAACACT CTGACAGTAG CAAATGAAGG AACCTACACA
GTAGATAATG CTGGCAACCT TACCTTCACC CCCGAACCAG AATTTGCTGG AACTACGGCT
ACTATCACCT ACACAGTAGA AGATAATAAT GGTGAAGTCT CAAACCCAGC CGAAATAGGT
GTTACAGTCA TCCCATTCGA CCCATTAGAT CAACCAGTAA AGTTTGATTT TAACGGCGAT
GGAGTAGCAG ATATTCTTTG GCGTCGTGAA AATGGACCTA ACCGCATTTG GCTGATGAAT
GATAATGGTA CGCGTAAGAG TACGAAAAAC CCGGGAAATT TTGGAGCTGC ATGGGATGTG
GCAGGAGTGG GAGATTTTAA TGCTGATGGA GTAGCAGACA TTTTCTGGCG TCATAATAAA
AATAGAGGTA ACCGCGTTTG GTTGATGAAT GATAACGGTA CGCGTAAGAG TACGAAAAAC
CCGGGAAATT TTGGAGCTGC ATGGGATGTG GTAGGAGTGG GAGATTTTAA TGCTGATGGA
GTGGATGACA TTCTCTGGCG TCGTGACAAT CAAAAATTAA ACCGCATTTG GTTGATGAAT
AATAACGGCA AACGTAAACA ACTGGTTAAC CCGGGAAATT TTGGAGCTGC ATGGGATGTA
GCTGGAGTTG CAGATTTCAA TGCTGATGGA GTGGATGACA TTCTCTGGCG TCATAACAAT
GGACGGAACA GAATTTCGTT TATGAATAAT GACGGCAAGC TTGATAATAC AGTTAACCCC
GGAGGTTTAG GTTCAACATG GGATGTAGCC GGAGTTGCAG ATTTCAATGC TGATGGAGTG
GATGACATTC TCTGGCGTAA AAAAAATGGA ACTAACAGCA TTTGGTTAAT GAATGGTGAT
GGCACACATG ATGATATAAT TAACCCAGGG TCTTTCGGTT CAGCTTGGGA TGTGGCAGGA
GTTGCAGATT TCAATGCTGA TGGACTGACA GATATTCTCT GGCGCCATGA CAATGGAGCT
AACCGTATTT GGTTGATGGA TGATGATAGC ACCCGTGCTC AGAACCTTAA CCCTGGAGCT
TTCGGTTCAG CTTGGGATAT AGTTGGGATG TAA
 
Protein sequence
MTQPTPQDQY MLELVNRSRA DPQAEADLYL DGNLNEGLSE GQISSDAKQP LAFNLNLNTA 
AKGHSQWMLD NNIFSHTGAN GKKSGDRMRD SGYIFTGTYG SGENIAWRGT TGTPNFTTFV
EKNHENLVRS NSHRLTLMSS NLQEVGISSL QGEFTFEGIN YNTVMTTQNF AYSGTSGPFI
TGVAYTDAVK DDNFYTVGEG ISGIIVTAVN TNNSNNIFTT TTWDAGGYSL DVDPNQTYDV
TFSGDLNGDG QAGDTVTYQV KVSSENVKLD VVSDSLPTPN APPIAVNDTT NTSKGQAVTF
SITENDSDTD GTLELATVDL DPSTAGRQNT LTVANEGTYT VDNAGNLTFT PEPEFAGTTA
TITYTVEDNN GEVSNPAEIG VTVIPFDPLD QPVKFDFNGD GVADILWRRE NGPNRIWLMN
DNGTRKSTKN PGNFGAAWDV AGVGDFNADG VADIFWRHNK NRGNRVWLMN DNGTRKSTKN
PGNFGAAWDV VGVGDFNADG VDDILWRRDN QKLNRIWLMN NNGKRKQLVN PGNFGAAWDV
AGVADFNADG VDDILWRHNN GRNRISFMNN DGKLDNTVNP GGLGSTWDVA GVADFNADGV
DDILWRKKNG TNSIWLMNGD GTHDDIINPG SFGSAWDVAG VADFNADGLT DILWRHDNGA
NRIWLMDDDS TRAQNLNPGA FGSAWDIVGM