Gene Tery_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4138 
Symbol 
ID4245652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6383569 
End bp6385107 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content41% 
IMG OID638109039 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_723619 
Protein GI113477558 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.453924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATGTAGACAA AATTAAAGAT CACTTTCAAC TTTTCCAAGA GCCAGAATAC 
CAAGAAATGT TCGCTCGGAA AAGAGAATTT GAAGGCGGTG CTTCCAAAGA AGAAATAGAA
AGAGTTCGTG AGTGGACAAA AAGTTGGGAA TATCGTGAGA AGAACTTTGC TCGTGAGGCT
CTAACTATCA ACCCTGCTAA AGCTTGTCAG CCTTTAGGTG CAATATTTGC AGCTGCAGGT
TTTGAAGGAA CTCTTCCTTT TGTACATGGT TCTCAAGGAT GTGTTGCTTA CTTCCGTTCT
CACTTAACTC GTAACTACAA AGAACCATTC CAAGCGGTTT CCTCTTCTAT GACTGAAGAT
GCTGCTGTAT TTGGTGGTCT GAAAAATATG ATTGATGGTT TGGCAAACTC TTATGCTTTG
TACAAGCCTA AAATGATTGC TCTTTGCACC ACTTGTATGG CAGAGGTAAT TGGAGATGAC
TTGGGTTCAT TCATTACCAA CTCCAAAAAT GAAGGTGCAG TACCTCAAGA TTTCCCAGTT
CCTTTTGCTC ACACTCCTAG CTTTGTTGGT TCTCATATCA CAGGCTATGA CAATATGCTC
AAGGGTATCC TAATAGCTCT TACTGACGGT AAGAAGACAG AAACTGATAA TGGAAAAATC
AACTTTATCC CTGGTTTCGA CCCTTACATT GGCAACATCC GGGATTTAAA GAATATTCTG
TCTTTAATGG ATGTTCCTAG CACTGTTTTA GCTGACAACG CTGAGAGTTT TGATTCTCCT
AACTTGGGTG AATTCAAGAT GTACAATGGT GGTACAACTC TAGAAGAAGC GGGTGATTCC
ATCAATGCTA AAGCTACTAT TTCCTTCCAA AAATACAGCA CTCCTAAGAC TCTAGAGTAC
CTGAAACAAG AAGGTGGTCA AAAAACAGCT ACATACCGCC CTATTGGTGT TCGTGGTACA
GATGAGTTCT TAATGGCTTT GTCTGAATTG ACTGGTAAGG CTATTCCTGA AGAGTTAGAA
ATTGAGCGTG GTCGTGTAGT TGATGCTATC ACTGACTCTC AAGCTTGGTT GCACGGTAAG
CGTATTGCTA TCTACGGTGA TCCTGACCAT GTATTGGGCT TGTTGAATTT CACTCTAGAA
TTAGGTATGC AACCAGTTCA CGTTGTTGTA AATAACGGTA ACGTTGCTGG TTTTGAAGAA
GAAGCTAAGG AATTGTTAGC TAATGATCCT AATGGCAAAG AAGCTACAGT TTGGATCGGT
AAGGACTTAT GGCACTTACG TTCATTGTTG GATACTGAGC CAGTTGATTT GTTAATTGGT
AACTCATACG GTAAGTTCCT ACAACGTGAC ACTGGTACTC CATTAGTACG TATTGGCTAT
CCTATTTTCG ACCGCCATCA CCAACACCGT TATTCTATCT TAGGATATAA GGGAGCATTC
AACCTCATCA ACTGGATCGT TAATACTATC CTTGATGAAT TAGACCGTGG TAGCATGGAT
CTAGGTGTTA ACGATACATC TTTTGACTTG GTTCGTTAA
 
Protein sequence
MSQNVDKIKD HFQLFQEPEY QEMFARKREF EGGASKEEIE RVREWTKSWE YREKNFAREA 
LTINPAKACQ PLGAIFAAAG FEGTLPFVHG SQGCVAYFRS HLTRNYKEPF QAVSSSMTED
AAVFGGLKNM IDGLANSYAL YKPKMIALCT TCMAEVIGDD LGSFITNSKN EGAVPQDFPV
PFAHTPSFVG SHITGYDNML KGILIALTDG KKTETDNGKI NFIPGFDPYI GNIRDLKNIL
SLMDVPSTVL ADNAESFDSP NLGEFKMYNG GTTLEEAGDS INAKATISFQ KYSTPKTLEY
LKQEGGQKTA TYRPIGVRGT DEFLMALSEL TGKAIPEELE IERGRVVDAI TDSQAWLHGK
RIAIYGDPDH VLGLLNFTLE LGMQPVHVVV NNGNVAGFEE EAKELLANDP NGKEATVWIG
KDLWHLRSLL DTEPVDLLIG NSYGKFLQRD TGTPLVRIGY PIFDRHHQHR YSILGYKGAF
NLINWIVNTI LDELDRGSMD LGVNDTSFDL VR