Gene Tery_4466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4466 
Symbol 
ID4246119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6889357 
End bp6890832 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content40% 
IMG OID638109349 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_723926 
Protein GI113477865 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.666985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAAG TAACATCTGT ATCTCAGTTA TCAGATGTAC GACCTACTGA TTGGGCTTTC 
CAAGCTTTGC AATCTCTAGT AGAACGTTAC GGTTGTATTG CTGGTTATCC AGACGGTACT
TATAAAGGTA ATCGGGCAAT GACTCGTTAT GAGTTTGCAG CAGGTTTAAA TGCCTGTTTA
GAGAGAGTTA CAGAGTTAAT TGCTCTTGCA ACTGCAGATT TAGTAACTCG TGATGATTTG
GCTGTTTTAC AACGACTTCA AGAAGAGTTT GCTGTAGAAT TAGCAGAGTT GCGTGGTCGT
GTTGATGCCC TTGAAGCTAG AACAGCAGAA CTTGAAGCGA ATCAATTCTC TACAACTACT
AAGCTCAATG GTGAAGTTCT ATTTTGGGTG ACTGATACTT GGGGAGAACG AGCAGAAGCT
CGTGGTGAAC CACAAAGTGA AAACGATAGA ACTGAAGCTG CATTAGGTTA TCGGGTTCGT
TTGAACTTTG ACACTAGCTT TATGGGTAAA GACCGCCTCA GAGCTCGTTT ACAGGCCAGA
GATATTCCTA ACTGGAGTGC CCGGGATTTA ACTAATACTC TTATGACCAG ATTGGGTACA
GATGAAAGTG ACCCTGACGA TACAGTGGTT CTTGATAAAT TGTTTTATCA GTTTCCTGTT
GGTGATCAAT TACAGGTAAT TATTGGCCCT CAAGGGGTTG AAGTCGATGA CTTTCAGACT
GTCTTATCTC CCTTTGAAAG TAGTGGTTCT GGCGCTACTT CAAGATTTGG ACGATATAAC
CCTACTGCCT ATCGTGGACC TGATGATGGA GGACTAATTG TTCAATACAA ACCTGCTAAA
CAATGGCAAA TTAATGCTGG TTATTTAGCT GGAGAGCCAG AAAATCCTCG AGAAGGAAAT
GGTTTATTTA ATGGTGAACA TAGTGCGTTT GGTCAAGTTG CATTTGAACC TAATTCCAAG
CTAGCTTTTA CAGTTAACTA TGTTCGTAAG TATTTTATTA AGGATGAGGT TAATGTTACC
TCTAGTACAG GAAGTTTCCG AGCACGAGAC CCTTTTGATG GCAGACGAAC TACGGCTGAT
AATATTGGAC TTGAAGCTCA GTGGAAGCTT AACGATCATG TCCAAATTGG GGGTTGGTTT
GGTACTACCT GGGCGCGTCC TGAAGATGGT AATAATGATG ATGATGATGA CATCACTATT
ATTAATGGCG CACTTACAAT TGCTTTTCCT GACTTATTTA AAGACGGTAG CTTAGGAGGA
ATTATTGTTG GTGTACCACC AATTATTACA GATGGTGGTA ATGATGATAA TTTGAAAGAC
CCTGATACTT CTGTCCATGT TGAAATTTTC TATCGTTATG CGATTAATGA CTTTATAGCA
ATTACACCTG GTTTGTTTGT GATTACTAAT CCCAATCACG ATGAGGATAA TGAAACTCTT
TGGGTAGGTA GTTTGAGAAC CACCTTTAAG TTCTAG
 
Protein sequence
MGQVTSVSQL SDVRPTDWAF QALQSLVERY GCIAGYPDGT YKGNRAMTRY EFAAGLNACL 
ERVTELIALA TADLVTRDDL AVLQRLQEEF AVELAELRGR VDALEARTAE LEANQFSTTT
KLNGEVLFWV TDTWGERAEA RGEPQSENDR TEAALGYRVR LNFDTSFMGK DRLRARLQAR
DIPNWSARDL TNTLMTRLGT DESDPDDTVV LDKLFYQFPV GDQLQVIIGP QGVEVDDFQT
VLSPFESSGS GATSRFGRYN PTAYRGPDDG GLIVQYKPAK QWQINAGYLA GEPENPREGN
GLFNGEHSAF GQVAFEPNSK LAFTVNYVRK YFIKDEVNVT SSTGSFRARD PFDGRRTTAD
NIGLEAQWKL NDHVQIGGWF GTTWARPEDG NNDDDDDITI INGALTIAFP DLFKDGSLGG
IIVGVPPIIT DGGNDDNLKD PDTSVHVEIF YRYAINDFIA ITPGLFVITN PNHDEDNETL
WVGSLRTTFK F