Gene Tery_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4139 
Symbol 
ID4245653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6385521 
End bp6388331 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content40% 
IMG OID638109040 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_723620 
Protein GI113477559 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.820547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGA TAACCCCAGG CAAAGTTGCC GAACTTTTAA ATGAGCCAGG TTGCGAACAC 
AACCATCAAA AAAATAACGG AGATAAAAAA CAGAAAGGCT GCCAGCAACA AGCAGCACCC
GGAGCTGCTC AAGGGGGTTG TGCCTTTGAT GGTGCTAGTA TTGCTCTAGT TCCAATTACA
GATGCAGCTC ACCTGGTTCA CGGTCCTTTA GGTTGTTCTG GTAACTCCTG GGGAGCTCGT
GGAAGTCTTT CCTCCAGTTC CCATCTATAC AAAATGGGTT TCACCACTGA TATGGGTGAA
AATGACATTA TTATGGGCGG AGAGAAAAAA TTATTGAGGG CGATCGCTGA ATTAAAAAAA
CGTTATCAAC CTCCTGCTAT ATTTGTCTAT GCCACCTGTG TTACTGCTTT AATTGGAGAT
GACCTAGAAA CAGTTTGTAA AGTCGCTACT AAAAAATTAG AAATTCCGGT CATTCCCGTT
AACTCTCCTG GTTTCATCGG TAGCAAAAAT CTGGGTAACC GGGTTGGCGC AGAAGCATTA
TTAGACCATG TAGTAGGTAC GGCAGAACCT GAATTTACTA CACCTTATGA CATAAACCTA
ATTGGGGAAT ATAATATTGC TGGGGAAATG TGGGCAATGT TACCCCTGTT TGAAAAAGTT
GGCATTCGGG TATTGTCAAA AATCACGGGA GATGCTACTT ACAAAGAAGT TTGTTATGCC
CACCGTGCCA AACTCAACGT TATGATCTGC TCTAAAGCTA TGATCAATAT GGCACGAAAA
ATGGAGGAAG AATATGGCAT TCCTTACATT GAAGAATCAT TTTATGGTGT TGCTGATATT
AATAACTGCC TCCGGAATAT TGCTGCCAAA ATTGGGGATG CTGGCCTGCA AGAACGGACA
GAAAAACTGA TCGCTGAAGA AACAACTATT CTTGAAGAAA TATTAGAACC CTATCGCCAA
CGTCTAAAAG GTAAAAAAGT TGTACTCTAC ACAGGTGGGG TGAAAAGTTG GTCTATTATT
TCGGCAGCTA AAGATTTAGA GATGGATGTG GTTGCTACTA GCACCAAGAA AAGTACAGAA
GAGGATAAGG CTAAAATCAA AAAGTTGCTC GGTAAAGATG GTATTTTGCT AGAAAAAGGT
AATGCCGAAA TACTCTTAAA AGTAATTGCC GAGACTAAAG CTGATATGCT TATAGCAGGT
GGTCGTAACC AGTATACTGC TCTCAAAGCC CGAATTCCCT TTTTACACAT TAACCAAGAA
CGTCATCATC CCTACGCCGG ATATCATGGG ATGATAGAAA TGGCTAAAGA ATTAGATGAA
GCTCTCTATA GTCCGGTTTG GGAACAGGTG AGACAACCGG CACCTTGGTT AGAGGCATGT
CAGTTAGATG ATGTTTTGGC TGTTGAAACT TTACCAAGTT TAACTAATAT TCCACCAACA
ACAGTCAATT TTCATAAACA ATCATTATCC ACAAATCCTC TGAAACTCAG TCAACCTTTG
GGTGCGGCTT TAGCATACTT AGGAATTAAT GGGATGATGC CAATGTTCCA CGGTACTCAA
GGTTGTACTG CTTTTGCTAA GGTATTATTG GTGAATCACT TCCAGGAAGC TATTCCCTTG
TCTACTACTG CTATGAGTGA AGTGACAACT ATTTTGGGAG GGGAGGATAA CATTGATAAA
GCGCTGCTGA CTCAGCTTGA GAAGTCAAAA CCAAAGGTAA TTGGTTTGTT GACTACTGGT
TTAACTGAAA CCAGAGGGGA TGACATGGAG CGTATTCTTA AGAAATTTAG GGAAGAGCAT
CCAGAATTAG ATGGGTTCCC CATATTAAAT GTTTCTAGTC CAGATTATAA GGGTTCTGCT
CAGGACGGTT TTGCTACTAC AGTAGAATGT ATAGTAGGTT ATGATTATGG GGAGCCTATT
CCCAAAGAAA TTAAAAAACC TTTGATTACA ATTTTAGCTG GTTCTTGTCT TGCTCCTGGA
GATGTCCAGG AAATCAAGGA TATTGTAGAA GATTTTGGGT TCATTCCAAT AGTTGTACCG
GACTTATCTC AGTCTTTGGA CGGGCATTTA ATTGATGATA TTTATAGTGC TACAAGCTCA
GGTGGTACCA CAATAGAGGA TTTGCGTAAT TTACGTCACT CATCTTTCAC CTTTGCTATT
GGGGAAAGTA TGCGAAATGC AGCTATAATT TTGCAAGAAA AATTTGGTAC TCAATATCAA
GTGTTCTCTC GGTTGACTGG TTTGGGTGCT GTAGATAGTT TCATGTTAAA ATTGTCTCAG
CTAATTGTAT CTCGAATTGA TCCTCATCTT GACAAAGGCT GTGAAGTTCC TGAGAAATAC
CTACGCCAAC GCCGTCAGTT ACAGGATGCG ATGTTGGATA CTCACTTCTA TTTTGGGCAT
AAGCAAGTAT CTATTGCCCT GGAACCAGAT TTACTTTGGG CAACAAGTTG GTTTGTGCGA
GAAATGGGGG GAGATATTCA TTCTGCTGTC ACAACTACGC GATCGCCGTT ACTTGAAAAA
TTACCTACGG AAAATGTGAT AGTTGGGGAC TTGGGAGATT TAGAAGAAGT GGCAGCAGGT
TCGGATTTAC TAATTACCAA TTCTCGTGGT AAGATAATAT CTGAGAAGTT AAATATTGAT
CTCTATCGGA TGGGAATGCC AATTTACGAT CGCCTTGGTA ATGGTCAACG TTGTTCTGTT
GGTTACCGTG GTACAATGAA TTTATTATTT GATATTGGTA ATATTTTCCT AGAGCAAGAG
GAATCAAAAA TTCACACCAA TGATTATTCA TTATTAAGTA CTCAGGCATA A
 
Protein sequence
MAMITPGKVA ELLNEPGCEH NHQKNNGDKK QKGCQQQAAP GAAQGGCAFD GASIALVPIT 
DAAHLVHGPL GCSGNSWGAR GSLSSSSHLY KMGFTTDMGE NDIIMGGEKK LLRAIAELKK
RYQPPAIFVY ATCVTALIGD DLETVCKVAT KKLEIPVIPV NSPGFIGSKN LGNRVGAEAL
LDHVVGTAEP EFTTPYDINL IGEYNIAGEM WAMLPLFEKV GIRVLSKITG DATYKEVCYA
HRAKLNVMIC SKAMINMARK MEEEYGIPYI EESFYGVADI NNCLRNIAAK IGDAGLQERT
EKLIAEETTI LEEILEPYRQ RLKGKKVVLY TGGVKSWSII SAAKDLEMDV VATSTKKSTE
EDKAKIKKLL GKDGILLEKG NAEILLKVIA ETKADMLIAG GRNQYTALKA RIPFLHINQE
RHHPYAGYHG MIEMAKELDE ALYSPVWEQV RQPAPWLEAC QLDDVLAVET LPSLTNIPPT
TVNFHKQSLS TNPLKLSQPL GAALAYLGIN GMMPMFHGTQ GCTAFAKVLL VNHFQEAIPL
STTAMSEVTT ILGGEDNIDK ALLTQLEKSK PKVIGLLTTG LTETRGDDME RILKKFREEH
PELDGFPILN VSSPDYKGSA QDGFATTVEC IVGYDYGEPI PKEIKKPLIT ILAGSCLAPG
DVQEIKDIVE DFGFIPIVVP DLSQSLDGHL IDDIYSATSS GGTTIEDLRN LRHSSFTFAI
GESMRNAAII LQEKFGTQYQ VFSRLTGLGA VDSFMLKLSQ LIVSRIDPHL DKGCEVPEKY
LRQRRQLQDA MLDTHFYFGH KQVSIALEPD LLWATSWFVR EMGGDIHSAV TTTRSPLLEK
LPTENVIVGD LGDLEEVAAG SDLLITNSRG KIISEKLNID LYRMGMPIYD RLGNGQRCSV
GYRGTMNLLF DIGNIFLEQE ESKIHTNDYS LLSTQA