Gene Smed_6187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6187 
Symbol 
ID5320489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1109669 
End bp1110736 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content58% 
IMG OID640777805 
Productnodulation factor exporter subunit NodI 
Protein accessionYP_001314737 
Protein GI150378142 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01288] ATP-binding ABC transporter family nodulation protein NodI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAA ACGGGCGAGT TTTGAGACAC GAAGCGGAAA ATCAATTGTC AGATCGTGAG 
ATGGCCCAAG AGGCTCCGCG TCGGCTTGAG CCGAGTCCGT TCGAGTGGAA GGACCAAACA
GGTCCAGCCG TGAAGACCGC AATACCCGGC GCCATACCAA CCGTGGCAAT CGATGTTGCC
AGCGTAACAA AGTCCTACGG TGACAAACCT GTAATCAACG GACTGTCGTT CACCGTTGCA
GCGGGTGAGT GCTTCGGTCT GTTAGGTCCC AACGGTGCAG GCAAAAGTAC GATCACCCGT
ATGATCCTCG GCATGACGAC GCCTGGTACG GGTGAGATCA CCGTGCTCGG CGTGCCGGTT
CCGTCACGGG CTCGATTGGC ACGCATGAGG ATTGGCGTAG TTCCGCAGTT CGACAACCTC
GACCTGGAAT TCACTGTACG CGAAAACCTG TTGGTCTTCG GGCGCTACTT CCGGATGAGC
ACGCGCGAGA TAGAAGCGGT AATCCCATCG CTCCTTGAGT TTGCGCGCCT CGAAAACAAG
GCGGATGCGC GTGTTTCGGA CCTGTCTGGC GGCATGAAGC GGCGCCTTAC ACTGGCACGT
GCCCTCATCA ACGATCCCCA GCTACTGATA TTGGACGAGC CTACCACTGG ACTTGACCCG
CACGCCCGTC ACTTGATCTG GGAACGGCTG CGGTCGTTGT TGGCACGCGG AAAGACGATT
CTCTTGACCA CCCATATTAT GGAAGAGGCA GAGCGGTTGT GCGACCGGCT GTGCGTGCTC
GAAGCAGGGC GCAAGATCGC CGAAGGCCGA CCTCACATGC TAATAGACGA GAAGATCGGT
TGCCAGGTGA TAGAGATCTA CGGGGGCGAT CCACACGAGC TAAGTGCGTT GGTAAGCCCG
CACGCCCGCC ACATCGAGGT GAGCGGCGAG ACCGTCTTCT GTTATGCGTT CGACCCGGAG
CAAGTACGAG TCCAACTGGA TGGGCGCGCG GGTGTGCGCT TTCTGCAGCG TCCACCAAAT
CTCGAGGACG TTTTCTTACG GTTGACCGGG CGGGAGCTGA AGGACTGA
 
Protein sequence
MTGNGRVLRH EAENQLSDRE MAQEAPRRLE PSPFEWKDQT GPAVKTAIPG AIPTVAIDVA 
SVTKSYGDKP VINGLSFTVA AGECFGLLGP NGAGKSTITR MILGMTTPGT GEITVLGVPV
PSRARLARMR IGVVPQFDNL DLEFTVRENL LVFGRYFRMS TREIEAVIPS LLEFARLENK
ADARVSDLSG GMKRRLTLAR ALINDPQLLI LDEPTTGLDP HARHLIWERL RSLLARGKTI
LLTTHIMEEA ERLCDRLCVL EAGRKIAEGR PHMLIDEKIG CQVIEIYGGD PHELSALVSP
HARHIEVSGE TVFCYAFDPE QVRVQLDGRA GVRFLQRPPN LEDVFLRLTG RELKD