Gene Smed_6181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6181 
Symbol 
ID5320483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1103589 
End bp1104914 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content57% 
IMG OID640777799 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_001314731 
Protein GI150378136 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCA TCCTTTCTCA GACTAAATGG GCAACGATCA ACCCCCTGAA ATCGTCGCAG 
CCGCTGGGTG GCGCCTTGGC CTTTCTTGGT GTCGATGGTG CGATACCGCT ATTCCATGGC
AGTCAAGGTT GCACCAGCTT TGCACTGGTG CTTCTCGTTA GGCACTTCAA GGAAGCGATT
CCCTTGCAAA CCACCGCGAT GGACGACGTG GCAATAGTCC TCGGCGGGGC GGGTCATTTG
GAGCAGGCGA TCCTCAATCT CAAAATTCGC GCAAAGCCAA AGCTGATCGG TATATGCACC
ACGGCGCTGG TGGAAACTCG TGGCGAAGAT CTGGTGGGTG ATCTCGCCAG TATCAAGCTG
GAGCGCGCGG AAGAACTCAC AGGTACCGAC GTCGTGCTGG CCAATACACC GGATTTTGAC
GGCGCTATGG AGGAGGGTTG GGCCAAGGCT GTCACAGCAA TGATCAAAGC GATTACACGA
ATCGGCGAGC AGGAGCGGCA GTCGAGAACT ATAGCAATTC TCCCTGGGTG GAATCTCACT
ATAGCTGACA TCGAGCAGTT GCGCGATATA GTAGAAAGCT TCGGGCTCAA GCCGATCATC
CTGCCGGACC TCTCTGGCTC GCTTGATGGT ATAGTGCCCG ATGGCCGCTG GGTGCCGACG
ACATACGGCG GCATCAGCGT CGAGGAGATA CGCGAGCTTG GCACAGCAGC GCAGTGCATA
GCCATTGGTG AGCATATGCG CGGTCCAGCA GAGGAGATGA AGACGCTGAC CGGAGTTCCT
TACGTGCTGT TTCAGTCGCT GACAGGATTA AATGCGGTCG ACCGGTTTGT CTCGCTACTT
TCCTCTATTT CCGGTCGGCC CGCGCCCGCG AAAGTCCGCC GGCGCCGCGC ACAGCTGCAG
GATGCCCTGC TGGACGGACA TTTCCACTCG GCTGGCAAGA AGATTGCGAT CGCAGCCGAG
CCGGACCAGC TCTATCAACT CGCTACGTTC TTCATTTGCC TGGGTGCCGA GATTGTGGCA
GCCGTTACCA CGAAAGGTGC GTCGAAAATC CTTCACAAAG TACCGGTGGA AGTAATTCAG
GTCGGCGACC TCGGCGACTT GGAAAGTCTT GCCACCCATG CTGATCTTCT CGTCACGCAT
TCGCACGGCC AGCACGCTTC AGCACGTCTC GGCACTCCGC TAATGCGCGT CGGTTTTCCT
GTCTTCGACC AACTGGGCAG TCAGCACAAG CTCACAATTC TGTATCACGG AACGCGCGAC
TTGATCTTCG AAGTTTCCAA CATCTTCCAA TCCCATTCCC TTGCGCCGAC GCACCGGGGA
ACGTGA
 
Protein sequence
MVRILSQTKW ATINPLKSSQ PLGGALAFLG VDGAIPLFHG SQGCTSFALV LLVRHFKEAI 
PLQTTAMDDV AIVLGGAGHL EQAILNLKIR AKPKLIGICT TALVETRGED LVGDLASIKL
ERAEELTGTD VVLANTPDFD GAMEEGWAKA VTAMIKAITR IGEQERQSRT IAILPGWNLT
IADIEQLRDI VESFGLKPII LPDLSGSLDG IVPDGRWVPT TYGGISVEEI RELGTAAQCI
AIGEHMRGPA EEMKTLTGVP YVLFQSLTGL NAVDRFVSLL SSISGRPAPA KVRRRRAQLQ
DALLDGHFHS AGKKIAIAAE PDQLYQLATF FICLGAEIVA AVTTKGASKI LHKVPVEVIQ
VGDLGDLESL ATHADLLVTH SHGQHASARL GTPLMRVGFP VFDQLGSQHK LTILYHGTRD
LIFEVSNIFQ SHSLAPTHRG T