Gene Smed_1415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1415 
Symbol 
ID5322266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1496470 
End bp1497540 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID640790357 
Productbasic membrane lipoprotein 
Protein accessionYP_001327096 
Protein GI150396629 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.218463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0969049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TACTCGTCGC CCTCGCAACC ACGGTAGCCG TGCTCGGTAT CGCGCCGGCT 
GCAAGCGCTC AGGAGAAGGC AAAAATCTGC TTCATCTACG TCGGCTCGAA GACGGACGGC
GGCTGGACCC AGGCGCACGA CATCGGCCGT CAGGAGCTGG AAAAGGAACT CGGCGACAAA
ATCGAAACGC AGTTCCTGGA AAACGTGCCG GAAGGCCCGG ACGCTGAACG CGCGATCGAG
CGGCTGGCGC GTTCAGGCTG CGGCCTGATC TTCACCACGT CCTTCGGATT CATGGATGCA
ACGATCAAGA TCGCGGCGAA ATTCCCGGAT GTGAAGTTCG AGCACGCGAC CGGCTACAAG
ACCGCGCCGA ACGTTGCCAC CTATAACAGC CGCTTCTACG AAGGCCGTTA CATCCAGGGT
CAGATCGCCG CGAAAATGTC CGAAAAGGGC GTTGCCGGCT ATATCGCCTC CTTTCCTATC
CCCGAGGTAG TGATGGGCAT CAACGCCTTC GTCATCGGCG CCCGGGCGGT CAACCCCGAT
TTCAAGATCA AGGTCGTGTG GGCGAACACG TGGTTCGACC CGGGCAAGGA AGCGGATGCC
GCCAAGGCCC TCATCGACCA GGGCGTCGAT ATCATCACCC AGCACACCGA CACGACGGCG
CCGATGCAGG TCGCGGCAGA GCGCGGTATC AAGGCCTTCG GCCAGGCATC CGATATGATC
GCGGCCGGAC CGCAGACGCA ATTGACCGCC ATCGTCGACA CCTGGGGCGC CTATTACGTG
AAGCGCACCA AGGCGTTCCT CGATGGAACC TGGTCGACGT CTTCGAGCTG GGATGGCCTG
AAGGACGGTA TTTTGACTAT GGCGCCTTAC ACCAACATGC CTGATGACGT GAAAGCAATG
GCGGAAGAAA CAGAGGCGAA AATCAAATCC GGCGAGTTGA AGCCGTTCAC CGGTCCACTC
AACAAGCAGG ATGGCAGCCC TTGGCTGAAA GAGGGCGAGA CCGCCGACGA CGCAACGCTG
CTCGGCATGA ACTTCTATGT CGAAGGCGTC GACGACAAGC TGCCGCAATA A
 
Protein sequence
MKKLLVALAT TVAVLGIAPA ASAQEKAKIC FIYVGSKTDG GWTQAHDIGR QELEKELGDK 
IETQFLENVP EGPDAERAIE RLARSGCGLI FTTSFGFMDA TIKIAAKFPD VKFEHATGYK
TAPNVATYNS RFYEGRYIQG QIAAKMSEKG VAGYIASFPI PEVVMGINAF VIGARAVNPD
FKIKVVWANT WFDPGKEADA AKALIDQGVD IITQHTDTTA PMQVAAERGI KAFGQASDMI
AAGPQTQLTA IVDTWGAYYV KRTKAFLDGT WSTSSSWDGL KDGILTMAPY TNMPDDVKAM
AEETEAKIKS GELKPFTGPL NKQDGSPWLK EGETADDATL LGMNFYVEGV DDKLPQ