Gene Tery_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3241 
Symbol 
ID4243662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4964016 
End bp4965701 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content40% 
IMG OID638108238 
ProductNAD(P)H-quinone oxidoreductase subunit 4 
Protein accessionYP_722829 
Protein GI113476768 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGA CTCAATTTCC CTGGCTGACC ACTATAGTCC TACTGCCACT TCTGGCAGCT 
CTACTCATTC CATTCATACC TGATCAAAAC GGAAAAACTA TGAGATGGTA TGGTCTTGGT
GTAGGTGCTA TAGACTTTGC CTTAATGTGC TACGTTTTCT GGAAGTACTA CAATGCTAGT
GACTCTGGCT TTCAACTGGT AGAAAAATAT CTTTGGCTTC CTCAAATAGG TTTAAGTTGG
GCAGTATCAG TAGATGGTCT ATCTATGCCA CTAGTACTTT TAGCTGGTTT AGTGACAACA
CTCTCAATAT TCGCAGCGTG GCAAGTAGAC TACAAACCAC GTCTTTTCTA TTTCTTGATG
CTGGTTCTGT ACTCTGCCCA GATAGGAGTA TTTGTTGCTC AAGACCTAAT GCTACTCTTC
ATTATGTGGG AACTAGAATT AGTTCCTGTA TACCTACTCA TCTCAATATG GGGAGGTAAA
AAACGTCGCT ACGCAGCAAC AAAATTCTTG CTCTACACAG CAGCGGCTTC TATATTTATT
CTAGTGGCAG CCTTGGGAAT GGCCCTCTAC GGTGAAGGCA ATACTACCTT TGATATGGTG
GAATTAGGCC TAAAAGATTA TCCACTAGCT TTTGAACTTC TGCTATATTT AGGATTACTT
ATAACTTTTG GTGTCAAGTT AGCGGTTTTC CCTCTACATA CATGGCTACC TGATGCTCAT
GGAGAAGCTT CTGCACCTGT ATCAATGATA CTTGCTGGTG TATTATTAAA AATGGGCGGA
TATGGATTAA TTCGTCTGAA CTTAGAAATG CTTTCTGATG CCCATGTTTA TTTTGCTCCT
GTTCTAGCAA TTTTGGGCGT GGTTAATATT GTTTATGGTG GCTTAAATTC TTTTGGTCAA
TCTAACATGA AACGCCGTTT GGCCTACTCT TCCGTTGCTC ACATGGGTTT TGTGTTGTTG
GGTATTGCCT CATTCACTGA CTTGGGAATA AGTGGAGCTT TGCTACAGAT GATTTCTCAT
GGTTTAATTG CTGCAGTCTT GTTCTTCCTG GCAGGTGTAA CTTATGACCG CCTCCACACT
TTGGCATTAG ATGAAATGGG TGGTCTTGGT CAAGTGATGC CAAAAATATT TGCTCTATTT
ACAATAAGTG CAATGGCATC TTTAGCTCTC CCTGGAATGA GTGGTTTTGC GAGTGAGTTA
ATGGTATTTG TGGGAGTAAC TAGCAGCGAT ATTTACAGCT CTACTTTTTG TACTGTGACA
GTATTCTTAG CAGCAGTTGG TCTGATATTG ACTCCTATAT ATTTGCTTTC TATGTTGCGT
CAAATGTTCT ACAGCACTGG TAAAGCGCCA GTTTGTTTGC TCAAGAATAC TCCTTATGAA
AATGAAGTGT TGGATGAAGC AATTTGTTTT GGTACTAACT GTGTTTTACC TGCAAAAGCT
GTTTATACTG ATGCCAAACC AAGGGAAGTA GCGATCGCTG CTTGTTTCTT AGTTTTAATT
ATCGGAATTG GTTTATATCC TAAAATAGCA ACTAGAATGT ATGATGCAAA GATAGTTGCA
GTAAACACTC AAGTACGTCA ATCATACACT TTTGCTAAAG CTGATCCTCA ACTGTTTGCT
AAAGGATTTT TATTTCCCAG AATTCCTGAG TCTGAAGTAT TATCTGTTTC TGGTCTATTA
AGATAG
 
Protein sequence
MMETQFPWLT TIVLLPLLAA LLIPFIPDQN GKTMRWYGLG VGAIDFALMC YVFWKYYNAS 
DSGFQLVEKY LWLPQIGLSW AVSVDGLSMP LVLLAGLVTT LSIFAAWQVD YKPRLFYFLM
LVLYSAQIGV FVAQDLMLLF IMWELELVPV YLLISIWGGK KRRYAATKFL LYTAAASIFI
LVAALGMALY GEGNTTFDMV ELGLKDYPLA FELLLYLGLL ITFGVKLAVF PLHTWLPDAH
GEASAPVSMI LAGVLLKMGG YGLIRLNLEM LSDAHVYFAP VLAILGVVNI VYGGLNSFGQ
SNMKRRLAYS SVAHMGFVLL GIASFTDLGI SGALLQMISH GLIAAVLFFL AGVTYDRLHT
LALDEMGGLG QVMPKIFALF TISAMASLAL PGMSGFASEL MVFVGVTSSD IYSSTFCTVT
VFLAAVGLIL TPIYLLSMLR QMFYSTGKAP VCLLKNTPYE NEVLDEAICF GTNCVLPAKA
VYTDAKPREV AIAACFLVLI IGIGLYPKIA TRMYDAKIVA VNTQVRQSYT FAKADPQLFA
KGFLFPRIPE SEVLSVSGLL R