Gene Tery_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4137 
Symbol 
ID4245651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6381887 
End bp6383344 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content41% 
IMG OID638109038 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_723618 
Protein GI113477557 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.303688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTG AAAAGATTGA ACAGAATAAA CAGTTAATTC AAGAAGTCTT AGACGCTTAT 
CCCGCGAAAG CTGCTAAGAG ACGGAAAAAG CACCTTAACG TAATCGAAGA AAAAGGAGCT
GACTGTGGCG TTAAGTCTAA CGTAAAATCA GTTCCTGGTG TAATGACAAC TCGTGGTTGT
GCATTTGCTG GAGCGAAAGG TGTGGTTTGG GGTCCTGTTA AGGACATGGT TCACATTAGT
CACGGTCCTG TTGGTTGCGG TTACTACTCT TGGGCAGGTC GTCGTAACTA CTATAACGGT
GTAACTGGTG TTGATACTTT CGGTACAATG CAATTCACCT CAGATTTCCA AGAGAGAGAT
ATTGTTTTTG GTGGAGACAA AAAGCTCGCC AAAATTATGA ACGAAATTGA AGAGTTATTC
CCTCTGAATG CTGGTATCAC AATTGAATCT GAATGTCCAG TAGGTCTAAT TGGTGATGAC
ATTGAAGCGG TAGCGAAAAA AGCTAGCAAA GAACTCAATA AGCCAGTTGT ACCAGTACGT
TGCGAAGGTT TCCGTGGTGT TTCTCAGTCA TTAGGTCACC ACATTGCTAA CGACACAGTG
CGTGACTGGG TATACGAACC TTCTGCTAAA GTTACTAACG AAGAAATTGG TTTTGAGAAG
ACTCCTTATG ACGTATCCTT AATTGCTGAT TACAACATCG GTGGTGACGG TTGGAGTTCT
CGTTTGTTAT TAGATGAAAT TGGCTTAAGA GTTGTTAGCC AAGCAACAGG TGACGGTACT
TATAACGAAG TATTCATGGC TCCTAGGGTG AACTTAAACC TCATCCACTG CTATCGTTCT
ATGAACTATA TCTGCCGTTA CATGGAAGAA GAGTATGGTA TACCTTGGGT TGAGTTCAAC
TTCTTCGGTC CTAGTCAAAT TGCTAAGTCT CTCCGGAAGA TTGCTTCTTT CTTTGATGAC
AAAATCAAGG AAAACACAGA AAAAGTAATT GCTAGATATC AAGAACAAGC TGATGCAGTA
ATTGCTAAGT ATCGTCCTCG TTTAGAAGGC AAGAAAGTAA TGATGATGGT TGGTGGTCTC
CGTCCACGTC ACATTATTCC TGCTTTTGAC GATTTAGGAA TGGAAGTTAT TGGTACTGGT
TATGAATTTG GTCACGGTGA CGACTACAAG CGTACTGCTG ACTATGCTCA AGAAGGTACT
CTAATCTATG ATGACGTTAG TGGCTACGAA TTTGAAGAAT TTGCTAAGAA ATTAAAGCCA
GATTTAATTG CTTCTGGTAT TAAAGAGAAG TATGTTTTCC AGAAGATGGG TATGCCATTC
CGTCAAATGC ACTCTTGGGA TTATTCTGGT CCTTATCACG GTTATGACGG ATTCGCTATC
TTCGCTCGTG ACATGGATCT AGCTCTCAAT AGCCCAACTT GGAACTTAAT CAAAGCTCCT
TGGAAGCAAG CTAAGTAG
 
Protein sequence
MASEKIEQNK QLIQEVLDAY PAKAAKRRKK HLNVIEEKGA DCGVKSNVKS VPGVMTTRGC 
AFAGAKGVVW GPVKDMVHIS HGPVGCGYYS WAGRRNYYNG VTGVDTFGTM QFTSDFQERD
IVFGGDKKLA KIMNEIEELF PLNAGITIES ECPVGLIGDD IEAVAKKASK ELNKPVVPVR
CEGFRGVSQS LGHHIANDTV RDWVYEPSAK VTNEEIGFEK TPYDVSLIAD YNIGGDGWSS
RLLLDEIGLR VVSQATGDGT YNEVFMAPRV NLNLIHCYRS MNYICRYMEE EYGIPWVEFN
FFGPSQIAKS LRKIASFFDD KIKENTEKVI ARYQEQADAV IAKYRPRLEG KKVMMMVGGL
RPRHIIPAFD DLGMEVIGTG YEFGHGDDYK RTADYAQEGT LIYDDVSGYE FEEFAKKLKP
DLIASGIKEK YVFQKMGMPF RQMHSWDYSG PYHGYDGFAI FARDMDLALN SPTWNLIKAP
WKQAK