Gene Tery_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1234 
Symbol 
ID4242161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1911568 
End bp1913487 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content38% 
IMG OID638106446 
Productcarbohydrate-selective porin OprB 
Protein accessionYP_721057 
Protein GI113474996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.373435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAA CATTCTGGTA TTGTCTAATA CTTAGTCCAG CAGTTTTAGG AGCAACTCTT 
GTTGCATCCT CATACGCTCT GGCGACTGAA AACAAGCAAA CTAATACAGT TATTCAGCCT
CAAGCTGCAA CAACTGAAAA TATATCAATA AATAATAATA TAGATATACA GGAACCTGAT
CTATTAGCTC AAGTTACTGC AACCAATTCG ACTAATTCTG TTGGGAATAC ACCTGCAACT
ACCATACGAC AAATTAACCG CTATGGTAGA GAAGGTCGTA ATCTTCGTCG ACCAGGTCTT
CGTAATATTA CGAACAATAG GAAACCAATA AAAGGACAAG TTACTTCTGT ATCTCAACTT
TCTGATGTTC AGCCTACAGA TTGGGCTTTT CAAGCATTAC AGTCTTTGGT TGAGCGTTAT
GGATGTATCG TTGGATATCC AGATGGTACT TACAGAGGCA ACCGTGCCTT AACTCGTTTT
GAATTTGCTG CTGGTGTTAA TGCTTGCCTA GATAGAGTTA TGGAACTTTT ACAAGCAGCT
ATAGAAGATA CCGTAAGTAG AGAAGACCTG GCTATTCTGC AAAGGTTACA GGAGGAATTT
TCTGCAGAAT TAGCGATTCT CCGGGGTCGT GTTGATGCTC TAGAAGCTCG TGCTGCTGAA
TTAGAGGCTA ATCAATTTTC TACAACTACT AAACTTAGGG GTGAGGTAAT TTTTGCCCTT
GTTGATAACT TTGAAGACCA AGGAAAATAT GGAGCATTTG GTGATACTAT CCAGGATGAG
GATCAAACGG TATTCCAGTA TAGGGTACGT CTGAATTTCA ATACTAGCTT TTCTGGTGAA
GACTTATTAT TTACCAGGTT ACAAGCGGGT AATGCTCAAG CATTTAATCA AAAAATAGCT
GAGTCTGAGG GACAGCAAAC CTTTAATATT CTTGACCGTA CAGATGATAC GCTTCAACTA
GATAAGCTTT TCTACAAGTT CCCTGTCAGA GATAATATTA GAGTAACTAT TGCTGCTAAT
AAGGTAAGTT GGTACGACTT TGTTCCTACA CTTAATCCTT ATATGGAAGA TTTTGATGGA
GGCAGCGGTT CTCTAAGTGC TTTTGGTCAA AGAAACCCAA TCTATAGGTT AGGGGGTGGT
AAAGGACTTG GAGTTGAGTA TGATTTTGGT TGTAAAGACC AATATGCTTG TACCTATAGT
CCATTTTCTG TATCTTTTGG ATACTTAGCG GCAGAAGGTG AAAATCCTAG TCAAGGAAAA
GGTCTGTTTA ATGGAGACTA TGCTGCACTA GCGCAACTAA CCTTTACTCC CATCAGAAAT
TTCCAAGTTG GACTTACCTA CAATAAAGGT TATTTTGGTC CAGGGAACTT TGGATTTGAT
GATGGTGCAG CCACAGGTCT AGGTGATAAT GGTAGTTTAA ACAGTGGATT TGTAGGAACT
GGTATTGCTA ACAGTGTTTA TGGTTTAAAT GCTGGACTTA ATGATCGTCG ACCAGGTTCT
GTCAATATTA ATAAGTCAGT TTCGACTAAT GCTTATGGTG TAGAAATGTC TTGGCGAGCA
GCTGATTGGG TAACTATTAA TGGTTTTGGT ACATATCTTG ATGGGAAACT AATTGGCAAA
GGTGATTTTG AAATTTGGAC TTATGGAGCT ACTATTGCTT TCCCTGATTT ATTGAAGGAA
GGTAGTCTGG CTGGTATTGT TGTAGGTCGG GAACCTTATT TGAATGACTT AGAACTTCCT
AGTGGTTTAG ATTCAGGATT ACGACAAGAT ATGTCATGGC ATCTTGAAGC ATTCTACAAA
TATCAACTTA CAGACTACAT TTCAATTACT CCTGGTGTGA CTGTAATTAC AAATGCTAAC
CAAGATGATG ATAACGATGA GTTAATTATT GGTACTATCA GAACTACTTT TCAGTTCTAG
 
Protein sequence
MSKTFWYCLI LSPAVLGATL VASSYALATE NKQTNTVIQP QAATTENISI NNNIDIQEPD 
LLAQVTATNS TNSVGNTPAT TIRQINRYGR EGRNLRRPGL RNITNNRKPI KGQVTSVSQL
SDVQPTDWAF QALQSLVERY GCIVGYPDGT YRGNRALTRF EFAAGVNACL DRVMELLQAA
IEDTVSREDL AILQRLQEEF SAELAILRGR VDALEARAAE LEANQFSTTT KLRGEVIFAL
VDNFEDQGKY GAFGDTIQDE DQTVFQYRVR LNFNTSFSGE DLLFTRLQAG NAQAFNQKIA
ESEGQQTFNI LDRTDDTLQL DKLFYKFPVR DNIRVTIAAN KVSWYDFVPT LNPYMEDFDG
GSGSLSAFGQ RNPIYRLGGG KGLGVEYDFG CKDQYACTYS PFSVSFGYLA AEGENPSQGK
GLFNGDYAAL AQLTFTPIRN FQVGLTYNKG YFGPGNFGFD DGAATGLGDN GSLNSGFVGT
GIANSVYGLN AGLNDRRPGS VNINKSVSTN AYGVEMSWRA ADWVTINGFG TYLDGKLIGK
GDFEIWTYGA TIAFPDLLKE GSLAGIVVGR EPYLNDLELP SGLDSGLRQD MSWHLEAFYK
YQLTDYISIT PGVTVITNAN QDDDNDELII GTIRTTFQF