Gene Smed_6233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6233 
Symbol 
ID5320535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1154850 
End bp1156322 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content57% 
IMG OID640777837 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_001314769 
Protein GI150378174 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAC CCATGATTTT GCGTGAGAGC CGGACCAGCA CTACATTCTC TGACCAGTTG 
CTGGAGAACG CTAAATCGCT TGGCTGCTCA CCTCCATCGA CGGCGCTGGG CGACATAGAT
CCTGGAACTT GGGACAAGAT TAAGAATCAC CCCTGTTTTT CAGAGGAGGC GCATCACTAT
TTCGCGCGCA TGCACGTGGC GGTCGCGCCT GCCTGCAACA TCCAGTGTAA CTACTGCAAT
CGCAAATACG ATTGCGCCAA CGAAAGTCGG CCCGGTGTTG CCTCGGAAAA GCTCACACCT
GACCAGGCGG TGCGAAAGGT GATTGCCGTT GCCAACGAAG TGCCTCAGCT GTCAGTGCTC
GGCATCGCTG GGCCTGGCGA TGCTTGTTAC GACTGGAAGA AAACAAGGGC GACGTTCGAA
CGAGTGGCTA GGGAAATTCC GGACATAAGA CTATGCATCT CCACGAACGG GCTCTCGCTG
CCGGACCATG TCGATGAGCT TGCCGAAATG AACGTCGATC ACGTGACGAT CACCATCAAC
ATGGTCGATC CGCGTGTCGG CGTAAAGATC TATCCCTGGA TTTACTATGG TCAGCGCCGC
CACACTGGTA TCGACGCTGC GAGAATCCTG CACGAACGGC AGATGTTGGG CCTGGAGATG
CTGGCCGAAC GCGGCATCCT CACCAAGGTC AACTCGGTAA TGATCCCCGG CGTCAATGAT
GAGCACCTGA TCGAAGTCAA CAAAGTTGTG AAAGGCCGAG GCGCGTTGCT GCACAACGTA
ATGCCGCTAA TTTCAAACCG AGTACACGGG ACCTATTACG GACTGACAGG GCAGCGCGGC
CCGGAGGCCT TCGAACTGCA GGCCCTTCAG GACCGTCTAG AAGGAACCAA ACTGATGCGT
CATTGTCGAC AGTGCCGGGC CGATGCCATA GGCTTGCTCG GCGATGATCG TGGTCACGAG
TTCACGCTCG CTGAGATCCC CGACGGGATA ACCTACGATG CCAGCAAGCG ACAGGCCTAT
CGCCAGTTGG TCGCGCGCGA ACGCGGGGAT CACCTAGTGG CCAAGAACGA GGCGATCAGA
ACGGTAATGT CGGTGGAGTA TGGCGGATCG CTTCTCATTG CCGTGGCGAC CAAAGGCGGG
GGCCGGATCA ACGAACATTT TGGACACGCG AAAGAATTTC ACGTTTACAC CGTCTCTCGG
AGAGGGATCA AGCTGGCAGG CCGCCGCAGG GTTGAGCAGT ATTGCCTCGG CGGCTGGGGC
GAGGACGCCA CCCTCGACCA CATCGTCAAT GCGCTTGAAG GAATAGACAT TCTGCTCTGC
GTCAAGATCG GAGATTACCC AAGGAAACAG CTGACACAGG CCGGGCTTCG AGCGACGGAA
GCTTACGGCC ATGACTACAT CGAGAGTGCG CTCGGCGCGC TCTACGCCGC CGAGTTTGGC
ATCGAACCAC CGGTAAAGAC GGCGACAGCT TGA
 
Protein sequence
MFTPMILRES RTSTTFSDQL LENAKSLGCS PPSTALGDID PGTWDKIKNH PCFSEEAHHY 
FARMHVAVAP ACNIQCNYCN RKYDCANESR PGVASEKLTP DQAVRKVIAV ANEVPQLSVL
GIAGPGDACY DWKKTRATFE RVAREIPDIR LCISTNGLSL PDHVDELAEM NVDHVTITIN
MVDPRVGVKI YPWIYYGQRR HTGIDAARIL HERQMLGLEM LAERGILTKV NSVMIPGVND
EHLIEVNKVV KGRGALLHNV MPLISNRVHG TYYGLTGQRG PEAFELQALQ DRLEGTKLMR
HCRQCRADAI GLLGDDRGHE FTLAEIPDGI TYDASKRQAY RQLVARERGD HLVAKNEAIR
TVMSVEYGGS LLIAVATKGG GRINEHFGHA KEFHVYTVSR RGIKLAGRRR VEQYCLGGWG
EDATLDHIVN ALEGIDILLC VKIGDYPRKQ LTQAGLRATE AYGHDYIESA LGALYAAEFG
IEPPVKTATA