Gene Tery_4653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4653 
Symbol 
ID4246307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7155056 
End bp7158055 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content36% 
IMG OID638109520 
Productsurface antigen (D15) 
Protein accessionYP_724096 
Protein GI113478035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR00992] chloroplast envelope protein translocase, IAP75 family
[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0482957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCTTGT ATCCGTTTTT AGTGGCCATT TTTGCGGCTT TAACAACTTT TGGAATCTCA 
AAGTCTGCTA ATGCTCAAAT ATTTGGGAGT AGTGTAGATA CTGATTTAGT TTTTATATCA
GTTAATTCAA AAACACAGTT TGCTAAACTT TTTTCTACCA AATCTTTTTT AGATTTAGGA
TCAAGTTTCC CTTCACAAAA ATTATTTGTT AGTAGGAATA ACTTGTTGAG TAAGTCTTGG
TCGGAGGCAG AAATAAAGTT AGATTTTGAT AATTTATTAA AGATCAAAGA AATTACAAAA
TCATCTTCTA TTTATAAGAA AGAATTGGCT AATTTTTGGG ATTTGTATGT TTTGGAAAAT
CTAGAAAATC CTCAACTAAT CCTAACTAAT AAATCAACAA AAACTAAATC TGAATTTTTA
GTAAATAAAC AATTTTCTGT CAGCATTTCT AATCTGGAAA TGTTTAACAA TAGGAATCAA
AATTTTCAGA AATTGGTTTT GTCAAAATCA ACAGAAACTA AATCTGAACC TTTAATAAAT
AAACAATTTA CTGCAACTAT TCCTCAAGGA AAAATAGTTG AGAATAATAG TCAAAATTCT
CAGAATGTGG TTTTGTCAAA ATCAACAGAA ACTAAATCTG AATCTTTAGT AAATAAACAG
TTTATTGCCA ATATTCCTCA AGGAAAAATA GTTGAGAAGG AGAGTCTTGA TTCTCAGAAT
ATGGTTTTGT CAAAATCAAC AGAAACTAAA TCTGAACCTT TAGTAAATAA ACAGTTTATT
GCCAATATTC CTCAAGGAAA AATAGTTGAG AAGGAGAGTC TTGATTCTCA GAATATGGTT
TTGTCAAAAT CAACAGAAAC TAAATCTGAA CCTTTAGTAA ATAAACAGTT TATTGCCAAT
ATTCCTCAAG GAAAAATAGT TGAGAAGGAG AGTCTTGATT CTCAGAATGT GGTTTTATCA
AAATCAACAG AAACTAAATC TGAACCTTTA GTAAATAAAC AGTTTATTGC CAATATTCCT
CAAGGAAAAA TAGTTGAGAA GGAGAGTCTT GATTCTCAGA ATGTGGTTTT GTCAAAATCA
ACAGAAACTA AATCTGAACC TTTAGTAAAT AAACAGTTTA CTGCCAATAT TCCCACTACG
GAAATAGTTA CAGAATCAGA AAAATATATT TCCCCTCAAT CACAACAAGC ACAAACCTCT
ACAGAGGAAG AAGAACCCCT AGTGCTAGTA GCCGAAGTTG TAGTTACTGG AGTAGACGGA
GAACTCCAAG ATGAAGTTTA CCGAGTCATT AACACTCAAC CAGGAGAAAC AACCACTCGC
TCTCAACTAC AAAAAGATAT TAACGCTATT TTTGCTACTG GTTTCTTCCA AAATGTGAAA
GCAATACCTG AAGATACTCC TCTAGGTGTA AGAATTATTT TTAAAGTCGA ACCAAACCCT
ATATTGACTT CTGTAATAAT AGAGGGTGCA GAAGTATTTC CTGAGAGTGA AACAGAGAGA
ATATTCAATG AGCAATATGG TGAAACTTTG AACTTGAAAG TTTTTGAACA AGGTGTTGAA
CAAGTCGACC AATGGTATCA AGACAATGGT TATGTTCTTG GACAAGTTAT TGGTGCGCCC
CAAGTTGGTG ATGATGGTAC AGTTACCTTA GAAGTTGCAG AGGGAAAAAT TAAAGATATT
CAGGTACGCT TCCTAAATTC AGAAGGAGAA ACAGTAGATG AAGAAGGTAA TGTCATAAAA
GGTCGTACTC GTGAATATAT TATCACTAGA GAAATTGAAC TGAAAACAGG AGATATATTT
CAGCGACAGA CTGCAGAAAA AGACATTAGA AGAGTGTTTG ATTTAGGAAT TTTTGAAGAT
GTGAGATTGG GGTTAGAACC AGCACCAGAT GATCCTAATA CGGCAGTTAT TGTTGTGAAT
ATTGTAGAGA AAAGTACCGG TTCTCTTGCA TTTGGTGGTG GGGTTAGTTC TGCAAGTGGG
TTGTTTGGTA CAGTAAGTTA TCAACAACAA AATATTGGTG GTAATAACCA AAAATTAGGT
GGTGAATTTC AGGTTGGTGA ACGATTAGTA TTAGCAGATG TTAGTTTTAC AGATCCTTGG
ATCGGTGGAG ACGATCATCG ACTCTCTTAC ACGGTGAATG CCTTCAGACG GCGAACTATT
TCAGTGATTT TTGATTCCGA TGATGACGAT CAGCGTGATG TCGATTTGCC TAATGGTGAT
AACCCACGAG TTATTCGTAC AGGAGGTGGG GTTAGTTTTA CTCGTCCGTT TATCCCTAAT
CCATTTGTTG ATCCAGATTG GACTGCTTCT CTGGGATTAA AATATGAACG GGTGGAAATT
CAGGATCGTG ATGGTGAAGT TGAACCTAGA GATGAGTTGG GCAATAAATT GACAGTCGAT
GATTCTGGGA AAGATGATCT ATTCACTATT CAATTTGGTA TTGTCAATGA TCGACGCAAT
AATCCCCGAC AACCTACTTC TGGTAGTTTG TTGCGGTTTG GTGCAGAACA GTCTATTCCT
GTAGGCTCAG GAGAAATTGG TTTGAATCGT TTGCGGGGTA ATTTCAGTTA TTTTATTCCA
GTAAGTTTTA TTAACTTTAC TGATGGACCT CAAGCTTTAG CTTTTAATAT ACAAGCAGGC
CATATAATAG GAGACTTACC TCCTTATGAA GCTTTTGCTC TGGGTGGTAC TAATACAGTC
CGTGGATATG ATGAGGGTTC GGTGGCAGCA GGTCGTACTT TTGTCTTAGG AACTGTTGAG
TATCGCTTTC CTGTATTTAA ATTTCTTGGT GGTGCTTTAT TTGTTGATGC TGCAACAGTA
TTTGATAGTC AAAGATCTGT TATTGGTAAT CCTGGAGGTG TGCGGGAGAA GCCAGGAGAT
GGCATAGGTT ATGGTGGTGG TCTCCGGGTG AATTCTCCTC TGGGTCCGAT TAGAATTGAT
TATGCTATTA ATGATGAAGG TGACACTCGT TTCCACTTTG GTATTGGTGA GCGCTTTTAA
 
Protein sequence
MCLYPFLVAI FAALTTFGIS KSANAQIFGS SVDTDLVFIS VNSKTQFAKL FSTKSFLDLG 
SSFPSQKLFV SRNNLLSKSW SEAEIKLDFD NLLKIKEITK SSSIYKKELA NFWDLYVLEN
LENPQLILTN KSTKTKSEFL VNKQFSVSIS NLEMFNNRNQ NFQKLVLSKS TETKSEPLIN
KQFTATIPQG KIVENNSQNS QNVVLSKSTE TKSESLVNKQ FIANIPQGKI VEKESLDSQN
MVLSKSTETK SEPLVNKQFI ANIPQGKIVE KESLDSQNMV LSKSTETKSE PLVNKQFIAN
IPQGKIVEKE SLDSQNVVLS KSTETKSEPL VNKQFIANIP QGKIVEKESL DSQNVVLSKS
TETKSEPLVN KQFTANIPTT EIVTESEKYI SPQSQQAQTS TEEEEPLVLV AEVVVTGVDG
ELQDEVYRVI NTQPGETTTR SQLQKDINAI FATGFFQNVK AIPEDTPLGV RIIFKVEPNP
ILTSVIIEGA EVFPESETER IFNEQYGETL NLKVFEQGVE QVDQWYQDNG YVLGQVIGAP
QVGDDGTVTL EVAEGKIKDI QVRFLNSEGE TVDEEGNVIK GRTREYIITR EIELKTGDIF
QRQTAEKDIR RVFDLGIFED VRLGLEPAPD DPNTAVIVVN IVEKSTGSLA FGGGVSSASG
LFGTVSYQQQ NIGGNNQKLG GEFQVGERLV LADVSFTDPW IGGDDHRLSY TVNAFRRRTI
SVIFDSDDDD QRDVDLPNGD NPRVIRTGGG VSFTRPFIPN PFVDPDWTAS LGLKYERVEI
QDRDGEVEPR DELGNKLTVD DSGKDDLFTI QFGIVNDRRN NPRQPTSGSL LRFGAEQSIP
VGSGEIGLNR LRGNFSYFIP VSFINFTDGP QALAFNIQAG HIIGDLPPYE AFALGGTNTV
RGYDEGSVAA GRTFVLGTVE YRFPVFKFLG GALFVDAATV FDSQRSVIGN PGGVREKPGD
GIGYGGGLRV NSPLGPIRID YAINDEGDTR FHFGIGERF