Gene Smed_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2296 
Symbol 
ID5323157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2372143 
End bp2374347 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content61% 
IMG OID640791234 
ProductTonB-dependent hemoglobin/transferrin/lactoferrin family receptor 
Protein accessionYP_001327963 
Protein GI150397496 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.234344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAACC GGCATCATCG CCTGGCGCTT TTGGCTTGCA CGGCATTCGT TACCCTCGTC 
GAAACGACAC TTTCTCACGC GCAATCCACC GAGGAGGCGG CAGGTGCCGC CGAAAAGCAG
GGGCGCGTAA CAGTGCTGAA GAAACTTTCC ATCCAAGGCG GCGAGAAGGA GGGTGTGGCC
GACACGCCGC TCGCAGAACA AGTGACCGAG GAAGAACTCG ACCGCAATCA GATCACGAGC
TTCGAAGACC TGGGCAGAAG CCTCGAACCT GGCGTGAACT TCAACCGGAC GAGCGGCAGC
GTAAATATAC GCGGCCTCGA AGGGCCGCGC GTGCTGACGA CGATCGACGG CATTCCCATC
CCCTTCATCG ACGATGGCGC GCGCGATGCC GATGGCGGCA TAGACAGCTT CGATTTCTCG
GCGCTCTCGA CCATCGACAT CGTTCGCGGC GCGGATTCCA GCCGCGCGGG CGGCGGTGCT
CTCGGAGGAG CCGTGGTGCT GCGCACGCTC GAACCGGAGG ACCTGATCGG CGAGGGCAAG
ACGTGGGGCG GCATCTTCAA ATTCGCCTAT GACGGCGAGG ACGACAGCCT CGGGGGCTCC
GCTGCCGTTG CTGCGCGCTA TGACAACACG GCGGCCCTGT TCCAGGGGGG GTACAGGAGG
GGGCATGAAC GGGAGAGCAA CGGCGACGTC GGCGGCTATT CCACGACGCG CACCGAGGCC
GACCCGGCTG ACTACGACCA GAACAATCTC CTGTTCAAGG TGCGGCAATA TACCGAAAGC
GGCCATACTT TCGGCTTCAC GGCCGAGCGC TTCGACCGCG ACAAGGATAT CGACTTCATG
TCCGGCCAGA GCCTCTTCGG CAATTATCGG CCGGGCGATT ACGACAAGCT TGATGACACC
CGGCGCGAGC GTCTTTCGCT CGATTACGAA TTCGAAGCCA CCGACGACAA CGCTTGGTTC
GACAGCGCTT CGGCGACGGT CTACTGGCAG CGGCTGTTGC GCGAAAACGG CGTCGACGCC
TACCGCAGCA CGTCGGTGAT CGGCGACTAT TCGCGTCTCA ACGAAGCGGA AGAAGAGGGC
TTCGGCGCCA GCGGTTATGT CGAGAAATTC TTTGAAACCG GTAACCTGCA GCACCAGGTG
ACGCTCGGCG GCGACTTTGC CATCGGCAAG CTGCATCAAT ATTCCTCGGG CGAGGATAGC
TGCGACACGG CGCCGAGCCC ATCATGCAAT TTCCTGCATA CCAACCAGTC TGACAGCCCC
GACGTCGACA GCAGGAAAAT CGGCGTCTAC TTGCAGGACC GCATCTCGAT CGGCGACGGA
CCCTTCGCGC TGACGCCGGG GCTTCGCCTC GACTGGTACG ACTACGCGCC TGAGAACACG
GCCGCCTACA TGCGCAACCC CAACTATGAC GGTTTGCCGC CGGGACAGAG CGACCTCGCG
CTGTCGCCGA AGCTGCTTGC AACCTATCAG GCAGCGGAGG AGGTCGAACT CTTCGCGCAA
TGGGCAATGG CTTTCCGGCC TCCGACCGCC GAGGAGCTCT ATCTGAATTA CGGCGCACCG
GGCAGCTATC TTCAAATAGG CAATCCCGAC CTGAAGCCGG AAACGAGCCA GGGTTTCGAG
ATCGGAGCCA ATCTCGGCGA TGAGGAATTC GGCGGTCGCA TCGCTGCCTT CTACAACAGA
TACAGAAATT TCATCGATGA GCGCTGGAGT TTCGATCCGA CCGGCACCTA CCCGATCGGC
ATCACCGAAG CCATCAACCG CGCCAATGTC TCCATCCATG GCGTCGAGGT CTCGGGGCAC
AAGGTTTTCG CCAATGGTGT CCACGTGCGT ACCGCGCTCG CCTACGCTCA CGGCCGCGAT
CTCGATACCG ACGAAGTCCT TGGTTCTGTT GCCCCGTTGA AAGGCGTCCT CAATGTCGGC
TACGCCGCAG AGACCTGGGG AACCGATCTC ATCGTGACAG CCGCGATGGA CGTGTCCGAC
AAATCGACGA ACAGTTTCAA GGCGCCGGGC TATGGCATCG TCGATGTCAC CGGCTGGTGG
GAGCCGGAGC AAATGCCGGG TCTCAGAGTA AATGCCGGCG TCTACAACAT CTTCGACAAG
ACCTATTGGG ACGCGGTGAA CACGCAGAAC GCGTTCACGC AGCCCGCCGA CTTCTATTCG
GAGCCGGGGC GCACTTTCAA GGTCTCGCTG ACACAGCGGT TCTGA
 
Protein sequence
MLNRHHRLAL LACTAFVTLV ETTLSHAQST EEAAGAAEKQ GRVTVLKKLS IQGGEKEGVA 
DTPLAEQVTE EELDRNQITS FEDLGRSLEP GVNFNRTSGS VNIRGLEGPR VLTTIDGIPI
PFIDDGARDA DGGIDSFDFS ALSTIDIVRG ADSSRAGGGA LGGAVVLRTL EPEDLIGEGK
TWGGIFKFAY DGEDDSLGGS AAVAARYDNT AALFQGGYRR GHERESNGDV GGYSTTRTEA
DPADYDQNNL LFKVRQYTES GHTFGFTAER FDRDKDIDFM SGQSLFGNYR PGDYDKLDDT
RRERLSLDYE FEATDDNAWF DSASATVYWQ RLLRENGVDA YRSTSVIGDY SRLNEAEEEG
FGASGYVEKF FETGNLQHQV TLGGDFAIGK LHQYSSGEDS CDTAPSPSCN FLHTNQSDSP
DVDSRKIGVY LQDRISIGDG PFALTPGLRL DWYDYAPENT AAYMRNPNYD GLPPGQSDLA
LSPKLLATYQ AAEEVELFAQ WAMAFRPPTA EELYLNYGAP GSYLQIGNPD LKPETSQGFE
IGANLGDEEF GGRIAAFYNR YRNFIDERWS FDPTGTYPIG ITEAINRANV SIHGVEVSGH
KVFANGVHVR TALAYAHGRD LDTDEVLGSV APLKGVLNVG YAAETWGTDL IVTAAMDVSD
KSTNSFKAPG YGIVDVTGWW EPEQMPGLRV NAGVYNIFDK TYWDAVNTQN AFTQPADFYS
EPGRTFKVSL TQRF