Gene Tery_4465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4465 
Symbol 
ID4246118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6886447 
End bp6888165 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content41% 
IMG OID638109348 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_723925 
Protein GI113477864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00227345 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAC TTATATGGAA TACTTTTAAG CACAGTCCTA GTGTTTTTAG TATAGCATTA 
TTGATGGCAG GCTCAGCAAT CGCTGCTGAG ACTCCTCTAC AAAATTTAGG AACTGATGAA
AGTCCTGTAA ATCAGAACTT AACCCAAGGC AGCATTGAAA TTGCTCAAAA TTTTGATACT
CGGTTAATGC CAATGGATGA CCCTTCACTT ACACCAGTAG GGGTTTCAGA CTTAGAAAAT
GGCGAGTACA TGGACCAGGT AACATCTGTA ACTCAGTTAT CAGATGTACG ACCTACTGAC
TGGGCTTTCC AAGCTCTACA ATCCTTGGTA GAGCGTTATG GTTGTATAGC AGGTTATCCT
GACGGTACTT ATAAAGGAAA TCGAGCGATG ACTCGCTTTG AGTTTGCAGC CGGTTTAAAT
GCCTGCTTGG ATAGAGTCAC AGAATTAATT GCTGCTGCAA CTTCAGACCT AGTAACTAGA
GAAGACTTGG CAGTTTTACA AAGACTACAA GAAGAGTTCA GCGCAGAACT AGCTGCTTTG
CGGGGACGAG TTGATTCTTT GGAGGCCAGA ACATCAGAAT TAGAGGCCAA TCAATTCTCT
ACTACAACAA AATTAAACGG TGAGGTATTG TTCTGGTTAA GTGATACCTG GGGAGAAAGA
GCCGCCGGTC GTGGGACAAA AAAGAGTGAG GAGGACAAAA CTGAGACAAC CTTTGCTTAT
CGAGTTCGTT TAATCTTTGA TAGTAGTTTT ACTGGGAAAG ACCGCTTGAG AACTCGTTTA
CAAGCTCGTA ACGTTCCAAA ATACGACAGT CGAGATTTGA CGAACACTAT GATGACTCGT
CTAGGTACTG ACGATGATTT TGATGACGAT TTTGTTCTCA ATAAATTGGC CTATCGTTTT
CCTTTACTAA ATGGTAGAGG TCAAATAGAA CTAGCAGCTA ATGGTTATGG TCTGGACGAC
TTCATGGGAC CCATTACACC TTTAGATAGT AGTGGTTCTG GTTCTATTTC AAGATTTGGG
CGATTTAACC CTACTTTTTA TCGTGCTCCA GCTGATGCTG GTGTTAAATT TGCTTATGCC
TTCAACGACG CAATTAAGTT GACAGTTGGT TATGCTGCAC CAGATCCAGA AGACCCTCAA
GAAGGTAAAG GTATTTTTAA TGGTGGCTTC AGTGGATTTG GCCAAGTTAC CTTTGAGCCA
AATGATAGGA TAGTTTTTCA AGCAGGTTAT GTGCGTGGCT TCCATCCTAA AGGAGATGTG
AACTTAACTG GAAGTACAGG TAGCTTTAAA GCCAAAGATC CTTTTAAGGG TATACGGACT
AGTGCAGATA ATATCAACTT TGAAGCCCAG TGGTTAATTA CTGAAGGTTT CCAAATAGGT
GGTTGGTTTG GTGCCTCTTT TGCCCGTCCT GAAGATAACA ATGATACAGA TGATATCACT
ATTGTTAATG GTGCTTTGAC TTTAGCTTTC CCAGACCTAC TCAAAGAAGG TAGCTTAGGA
GGTATTATCA TTGGTGTACC ACCAATTATT ACTGATGGTG GTGATGATGA TTCTCTGAAA
GACGATGATA CTTCTATTCA CGTTGAGGTA CTTTATCGCT TCCAAATGAA TGACAATATT
GCCATTACTC CTGGTGTATT TGTGATTACT AACCCTAATC ACATTGAGGA TAATGAAACT
CTCTGGGTTG GTACAATAAG AACTCAATTC AGATTCTAG
 
Protein sequence
MIKLIWNTFK HSPSVFSIAL LMAGSAIAAE TPLQNLGTDE SPVNQNLTQG SIEIAQNFDT 
RLMPMDDPSL TPVGVSDLEN GEYMDQVTSV TQLSDVRPTD WAFQALQSLV ERYGCIAGYP
DGTYKGNRAM TRFEFAAGLN ACLDRVTELI AAATSDLVTR EDLAVLQRLQ EEFSAELAAL
RGRVDSLEAR TSELEANQFS TTTKLNGEVL FWLSDTWGER AAGRGTKKSE EDKTETTFAY
RVRLIFDSSF TGKDRLRTRL QARNVPKYDS RDLTNTMMTR LGTDDDFDDD FVLNKLAYRF
PLLNGRGQIE LAANGYGLDD FMGPITPLDS SGSGSISRFG RFNPTFYRAP ADAGVKFAYA
FNDAIKLTVG YAAPDPEDPQ EGKGIFNGGF SGFGQVTFEP NDRIVFQAGY VRGFHPKGDV
NLTGSTGSFK AKDPFKGIRT SADNINFEAQ WLITEGFQIG GWFGASFARP EDNNDTDDIT
IVNGALTLAF PDLLKEGSLG GIIIGVPPII TDGGDDDSLK DDDTSIHVEV LYRFQMNDNI
AITPGVFVIT NPNHIEDNET LWVGTIRTQF RF