Gene Smed_6222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6222 
Symbol 
ID5320524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1141362 
End bp1142789 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content59% 
IMG OID640777827 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_001314759 
Protein GI150378164 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCGC TCAGTGCTAA AAATCAGGCC TTCTTCAACG AACCTGCCTG CGAAAGAAAC 
CGCAGCAAGG ACTTTGAGGT GCGAAAGAAG GGTTGTTCGC AGCCGCCGAT GCCGGGAGCG
GCAGCCGGCG GCTGCGCATT TGACGGGGCA AAGGTGGCAC TGCAGCCGAT CACCAACGTC
GCACATCTGA TCCATGCACC GCTCGCCTGT GAGGGCAACT CCTGGGACAA TCGCGGCACG
GCTTCTTCGA GTCACATGCT GTGGCGCACA AGCTTTACCA CTGATGTTAC CGAATTAGAC
GTGGTGATGG GGCATAGCGA GCGAAAGCTC TTCAAAGCAA TCCGCGAGAT CAACGAGGCG
TATGCCCCGG CGGCGGTCTT CGTCTATGCG ACCTGCGTTA CGGCACTGAT TGGCGACGAC
ATAGACGCGG TGTGCAGGCG TGCGGCGGAA AAGTTCGGCT TGCCGGTCGT GCCGGTCAAT
GCGCCGGGCT TCGTCGGCTC AAAGAACCTG GGCAACAAAC TGGCGGGGGA AGCCTTGCTC
GATCATGTCA TCGGGACCGT AGAGCCCGAT GATGCCCGGC CTAGCGACAT CAATATCCTT
GGCGAATTCA ACCTCTCCGG TGAATTCTGG CAGGTAAGGC CGCTGTTGGA CAAGCTTGGT
GTCCGCGTCC GCGCCTGTAT TCCAGGCGAC TCGCGCTATC TCGATATTGC CACAGCGCAC
CGAGCACGCG CAGCTATGAT GGTGTGCTCA ACGGCGCTCA TCAATCTCGC ACGCAAGATG
CAGGAGCGCT GGGACATTCC CTTTTTCGAG GGCTCTTTCT ATGGCATCAC CGACACTTCG
GAAGCACTCA GGCAGATCGC TGGACTGCTT GTAAGGCAAG GGGCCGGTCC GGACCTAATC
AGTCGCACCG AAGCACTGAT TGTAGAAGAA GAGGCAAGGG CGTGGAGGAG ACTTGAGGTC
TATCGACCTC GGCTGCAAGG CAAGCGCGTG CTTCTCAATA CCGGGGGGGT GAAATCCTGG
TCCGTCGCAC ATGCGCTGAT GGAGATCGGC ATGGAAATCG TCGGTACGTC AATTAAGAAA
TCGACGGACA ATGATAAGGA GCGTCTTAAG CAGATGCTCA CGAACGATAG CCGTATGAGT
GGGGCGGCGA CGCCGCGCGA GCTTTACTCG GCTCTATCGG ATCACAAGGC TGATATCATG
CTGTCGGGCG GACGCACGCA ATTTATAGCG CTCAAGGCAA AAATGCCCTG GCTCGATATC
AACCAGGAGC GCCAGCACTC CTACGCCGGC TACCACGGCG TAGTGGAACT CGCGCGCCAG
ATCGACCTAT CAATGCACAA CCCGACCTGG GCGCAGGTGC GCGAACCGGC GCCATGGGAG
ATGGCTCCTG CGCGAGGAGA TGGAGGAGGA AGCGCTACTG TGATTTAG
 
Protein sequence
MPSLSAKNQA FFNEPACERN RSKDFEVRKK GCSQPPMPGA AAGGCAFDGA KVALQPITNV 
AHLIHAPLAC EGNSWDNRGT ASSSHMLWRT SFTTDVTELD VVMGHSERKL FKAIREINEA
YAPAAVFVYA TCVTALIGDD IDAVCRRAAE KFGLPVVPVN APGFVGSKNL GNKLAGEALL
DHVIGTVEPD DARPSDINIL GEFNLSGEFW QVRPLLDKLG VRVRACIPGD SRYLDIATAH
RARAAMMVCS TALINLARKM QERWDIPFFE GSFYGITDTS EALRQIAGLL VRQGAGPDLI
SRTEALIVEE EARAWRRLEV YRPRLQGKRV LLNTGGVKSW SVAHALMEIG MEIVGTSIKK
STDNDKERLK QMLTNDSRMS GAATPRELYS ALSDHKADIM LSGGRTQFIA LKAKMPWLDI
NQERQHSYAG YHGVVELARQ IDLSMHNPTW AQVREPAPWE MAPARGDGGG SATVI