Gene Smed_4400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4400 
Symbol 
ID5319165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp896185 
End bp897462 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID640776204 
Producthypothetical protein 
Protein accessionYP_001313137 
Protein GI150376541 
COG category[S] Function unknown 
COG ID[COG1415] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.901272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.333483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA GGGCAGGTAA TGCGGATCTC CCGCTGCATG GCGGCCGGGT GCCGCGGTGG 
CTCGGCGATC GCATGACGCG CTTGGGCGCG CTGGTCACCG AAGCGATCGT ACATCACTAT
GGACGCGACG AGTTTTTGAG GCGCCTGGCG CATCCGTTCT GGTTCCAGTC CTTCGGTGCC
GTAATGGGAA TGGACTGGCA TTCGTCCGGG ATCACCACGA GCGTCATCGG GGCGCTGAAA
CGCGGCCTCA CGCCCCTTGC CGGCGAGCTC GGCATCCATG TTTGCGGCGG CCGCGGCCAG
CACTCCCGCA AGACGCCCGG CGAACTCGTC TCGATCGGCG ATCGCATCGG TTTCGACGGC
GGCGCAATGG CGGAGGCGAG CCGACTCGTA GCAAAAGTGG ACAGTGCCGC CGTTCAGGAC
GGCTTCGACC TCTACCTGCA TGGCTTCATC ATCACGGACG ATGCCAAATG GGTGGTCGTC
CAACAGGGCA TGAACGGCGA CCGGCGCCAG GCGAGGCGCT ATCACTGGCT TTCCGAAGGG
TTGACGAGCT TCGTCGATGC GCCGCACAGC GCGATAGAGG GCAGAGGACA GGGCGAAATC
TTCAATCTCG CAGACCGCCG GGCTGCCGCG TCGCGGAGTG CGCAGCTCGA TCTCCTCCAC
TCACTCGGGC CCGACGGACT CTTGCGTGAG GTCGCCTCGA TCGAGGCTCG CGCTGCTCCT
CAGGCAGAGC CGGCACAGCC GCTGCTGCCG CATCTCTTCA TGCCCGCCCA TCACGAGGTT
CGTGAATCCG ACGTCAATCT CCGGCGCCTT CACGGCAGCT TCGCCGCGGC CGCCGAGCGC
GGACCTGAAG ACTTCAAGGA CCTGCTCCTC GTGCCGGGGG TCGGGGCCCG GACGGTCAAA
GCACTGGCGA TGGTCGCGGA GGTCGTTCAC GGAACGCCGT GCAGGTTCTC CGATCCCGCC
CGCTTTTCGC TCGCCCATGG CGGCAAGGAC CGTCATCCGT TTCCGGTTCC GTTGAAAGTT
TATGACGAGA CTATCGGCGT CATGAAGTCC GCGGTGAGTA AGGCCCGGCT CGGGCGCGAG
GAGGAGCTTG CGGCGCTGAA GCGACTTGAC GAGCAGTCGC GACGGCTGGA ACGCTACGTC
ACCGGCCCTG ACCTCAAGGA GATCGTCGCG GGCGAATTCA GGGACTCCGC GCGTTTCGGC
GGGCGCAGCA TCTTCGGCTG GGAACCGCCC GAGGAAGAAA CGATCATTTC CGAGCCGGGC
GACCGCGCGC GGCGTTGA
 
Protein sequence
MAQRAGNADL PLHGGRVPRW LGDRMTRLGA LVTEAIVHHY GRDEFLRRLA HPFWFQSFGA 
VMGMDWHSSG ITTSVIGALK RGLTPLAGEL GIHVCGGRGQ HSRKTPGELV SIGDRIGFDG
GAMAEASRLV AKVDSAAVQD GFDLYLHGFI ITDDAKWVVV QQGMNGDRRQ ARRYHWLSEG
LTSFVDAPHS AIEGRGQGEI FNLADRRAAA SRSAQLDLLH SLGPDGLLRE VASIEARAAP
QAEPAQPLLP HLFMPAHHEV RESDVNLRRL HGSFAAAAER GPEDFKDLLL VPGVGARTVK
ALAMVAEVVH GTPCRFSDPA RFSLAHGGKD RHPFPVPLKV YDETIGVMKS AVSKARLGRE
EELAALKRLD EQSRRLERYV TGPDLKEIVA GEFRDSARFG GRSIFGWEPP EEETIISEPG
DRARR