Gene Tery_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3033 
Symbol 
ID4244917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4684202 
End bp4685209 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content31% 
IMG OID638108064 
Productsucraseferredoxin-like 
Protein accessionYP_722657 
Protein GI113476596 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4759] Uncharacterized protein conserved in bacteria containing thioredoxin-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.262955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00219158 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAGT TTTTTTGTTC TAGCGCTTGC CGAGAAGCAG ATGAAGACAT TATTGGAAGT 
GGTACGAATT ATTCAGTTTA TGTATTGATA GAATGTCCTT ACCCTTGGAA ACATAATGCT
TTCGAGTCTC GTTTTTTACC AAAAAACTTA GAGATGTTGA TGGCAAAAGT GAAAAGGGAT
AAATTGTCTC TCAGATTTTT ACTGATTACT CAAAATCAAA ATTATAGGCA AAATAACAGA
AAGATTTTAA TTTATGAAAA AAATAAATCT TCATTTATCA ATAGTTATAA AAAATATGAG
TTTGACGTAG ATCATCCTGA AAAAATAGCT CCAATTATTC AAAAATACTT AGCAGGAGAT
AATTTAGATA CTAACACTCA AAATCCTCAA ATAAGAGATC TATTAGTTTG TACTCATGGT
AGTCACGATA AGTGTTGTGC TAAATATGGT AATCCGTTTT ATGCGGAAGC TAAAAAAACT
ATTTCTGAAT TGGGTTTAAA AAATACAAGA ATTTGGAAAA CAAGTCACTT TGGTGGTCAT
AGGTTTGCAC CTACTATGAT TAGCTTTCCT GATGGTAGAT ATTATGGATT ACTTAATCGA
GAATCTTTTC AAACAATTTT GCTACAAGCC GGGAATATAA AATTATTAAG CCAAGTTTAT
CGAGGTTGGA GTATTTTACC AACTTCTATT CAAGTGTTGG AAAGAGAACT TATCTTCCGC
CACGGTTGGG AATGGTTTGA GTATAAAATT AATCTTTTAC ATCTGGATAT TAATTTCGAT
AAAACATTAG TTCAAACTCA ATTAGCTGTG CTTAAACCAG ATGGTTATCA ATATATTTGT
CAAGCTAAAT TAGTTAAAGA TGAAAGTAAA ACTATCTATA TTAAAGGATC TTGTGATGCA
TCTCACGAGT CCGAATTTAT AAAGTATGCT GTCAGTAATC TTAGCTTCAT AATTGAGAAG
AAAACTTCTG AAAAAGTGTT AATAAGTTCT CATACAAAGG TTAGTTGA
 
Protein sequence
MNKFFCSSAC READEDIIGS GTNYSVYVLI ECPYPWKHNA FESRFLPKNL EMLMAKVKRD 
KLSLRFLLIT QNQNYRQNNR KILIYEKNKS SFINSYKKYE FDVDHPEKIA PIIQKYLAGD
NLDTNTQNPQ IRDLLVCTHG SHDKCCAKYG NPFYAEAKKT ISELGLKNTR IWKTSHFGGH
RFAPTMISFP DGRYYGLLNR ESFQTILLQA GNIKLLSQVY RGWSILPTSI QVLERELIFR
HGWEWFEYKI NLLHLDINFD KTLVQTQLAV LKPDGYQYIC QAKLVKDESK TIYIKGSCDA
SHESEFIKYA VSNLSFIIEK KTSEKVLISS HTKVS