Gene Smed_5500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5500 
Symbol 
ID5319802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp466816 
End bp468111 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID640777255 
Productcytochrome c class I 
Protein accessionYP_001314187 
Protein GI150377592 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGA TGCTCGCGAT ACGCTGCCTG GCGGCAGCTG CTTCCTTCCT CTTCGGCGCG 
ACCGCGCATG GGGCAGAACT GCGCGGCCAT GGAGGCCCGG TGCGTTCAAT CGCCATAGCC
CCTGACGGAC AGACAGCGAT CACCGGCAGT TTCGACGCCA AGGCGATTAT CTGGTCGCTT
GAAACGGGCG AGGCGCAGCA GGTGCTGCTG TTTCACGAAA GCCAGGTCGA TGCCGTCGCC
GCTCTGCCGC AGGGGCGTTA CGCCACCGCC GGGGCCGACG GCCGTATCGC AATCTGGGAG
GCCGGACGCA GCACGCCGGT ATCCGTGTTG CATGGCCATG ACGGACCGGT TGTCGCACTG
GCAGTCGCGC CCGACGGCTC TACGCTCGCT TCCGCCTCCT GGGATGCGAG CGTGCGTCTG
TGGCCGCTTT CCGGCGCTCC GTCGCGCATC CTGAAAGGCC ACCATGGCAA TGTTAACGCG
GTCACCTTCC TGTCCGACGG AACGCTTGCA AGTGCGGGCT ACGACGCAGC CATCATACTA
TGGCCGCCGG GACATGACGC TGCGCCGATG CGGATATCCA TGCCGGGGCC GCTCAACGCC
CTCGTGACGG TGCCGGGCGA CCGCCTGCTT GCAGCAGGCG CAGACGGGAC ATTGCGCCAG
ATCGACCGAA GGGGCGCGAT TGTGGCCGAG GTCAGGGTGT CCACCGGTCC CCTGATCGCG
CTAGCGGCTA CGGCTGATAA GAGATACATC GCCGCCTCGG CCATAAGGGA GGGCATCGTG
TTGCTCGACT CCCGCACCCT GAAGCCGGTC AACACCTTCG GCGGTGCCGG CGTTGCGACG
GTGTGGGCCC TTGCCTTTGC ACGCGGTGAG CGGACGCTGT TGACCGGCGG TGCGGACACC
ATCATAAGCG AATGGGATGT CGAAACCGGC AGGCGGCTTG GAACCTCAGC CGCGATCCAG
GCCGATCTCA TGAGCGAATA TGCCGGCAAT CCGGACGCCG AAATCTTCCG TGCCTGCATC
GCCTGCCATA CGCTCGGCCC GGAGGACGGC AATCGCGCCG GTCCGACGCT GCACGGCATC
TTCGGCCGTA AGATCGCGGC GCTTGCCGAC TATCCCTACT CCCCCGCTTT CCGGCGAATG
GATATCGTCT GGACGCCGGA AACGGTGTCG AAGCTCTTTG AACTGGGGCC AAGCATATAT
ACGCCAGGGA CCAAGATGCC GAACCAGACG ATCAACGATC CCGAGGATCG TGCAGCGCTG
ATCCGCTTTT TGCAGTCCGA GACCCGCCGC GACTGA
 
Protein sequence
MKAMLAIRCL AAAASFLFGA TAHGAELRGH GGPVRSIAIA PDGQTAITGS FDAKAIIWSL 
ETGEAQQVLL FHESQVDAVA ALPQGRYATA GADGRIAIWE AGRSTPVSVL HGHDGPVVAL
AVAPDGSTLA SASWDASVRL WPLSGAPSRI LKGHHGNVNA VTFLSDGTLA SAGYDAAIIL
WPPGHDAAPM RISMPGPLNA LVTVPGDRLL AAGADGTLRQ IDRRGAIVAE VRVSTGPLIA
LAATADKRYI AASAIREGIV LLDSRTLKPV NTFGGAGVAT VWALAFARGE RTLLTGGADT
IISEWDVETG RRLGTSAAIQ ADLMSEYAGN PDAEIFRACI ACHTLGPEDG NRAGPTLHGI
FGRKIAALAD YPYSPAFRRM DIVWTPETVS KLFELGPSIY TPGTKMPNQT INDPEDRAAL
IRFLQSETRR D