Gene Smed_4463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4463 
Symbol 
ID5317933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp943842 
End bp945164 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content63% 
IMG OID640776264 
ProductHipA domain-containing protein 
Protein accessionYP_001313197 
Protein GI150376601 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCTA AGGCGGATGC CACCGAAGCC TTTGTCTGGA TTTGGCTACC GGGCGCAACA 
GAACCGGTTG TGGCAGGCCG CCTCTTCCAA GAGGGCGAGC GCCTGCTCTT TAACTACGGC
GCCAGCTACC GCGGCCGAAA GACCGCCATT CCTATCTATG AGCCCGAACT TCCCCTTCAG
GAAGGCGCGA TTGCGCCCAT CAACGGCTTG CAGATGGCGA GCTGCATTCG TGACGGATCG
CCCGATGCGT GGGGCCGCCG CGTGATCATC AACAGGCTGA CGGGCAAGAA GCCCGACGCT
GACGGCGTGC CGGAGATCAG CGAGCTTACC TACCTGCTCC ATTCAGGCTC CGACCGGACG
GGCGCCCTCG ATTTTCAGGC ATCGGCGACA GAATATGTCC CCCGCCGTGC CGCGCAGGCA
TCGCTCGACG AACTCATGGA AGCGGCCGCC CTCGTCGAAA AGGGCGTCCC TTTGACCCCG
GCGCTCGCCC AGGCCCTCAA CCACGGCACC TCGATCGGCG GCGCGCGCCC CAAGGCGCTG
ATCAACGACG ACACAAAGAA GTTCATCGCT AAATTCTCGG CGAGCAACGA CACCTACAGC
GTCGTGAAGG CTGAATTCAT CGCGATGAGG CTGGCGAGCG CCAGCGGGCT CGACGTGGCA
TCCGTATCGA TAACCCGTGC CGCGCATAAG GACGTGCTGC TGATAGAGCG GTTTGACCGC
AGGCACACCA AGGAGGGCTG GACGCGGCAG GCGATGGTCT CGGCGCTGAC GATACTGGGC
CTTGATGAGA TGATGGCCCG CTATGCTTCT TATGAGGATT TGGCGGAGCT GATCCGCCAT
CGCTTCACTG CCCCCAAGGA TACGCTCAAG GAGCTCTACG GGCGTATCTG CTTCAACGTG
CTGTGCGGCA ACACCGACGA CCATGCCCGT AACCACGCGG CATTTTGGGA CGGCAAGATG
ATGACACTGA CGCCCGCCTA TGACATCTGC CCGCAAAGCC GCACGGGCAC CGAAGCCACA
CAGGCCATGC TGATCAAGGG CGAGGGTCGC GCCAGCACGC TGGCGAATTG CCTTGCGGCC
GCGCCGGATT ACCACCTGAA AGAGGCGGAC GCAACCGCGC TGATCGAACA CCAGATCACA
ACCATCGCCG AACAATGGCA GGCGGTCTGC GCGGAAGCCG AACTGACCCC GGTCGACCGC
CAGTTTTTCG CCGGACGCCA GTTTCTCAAT AACTACGCCA TAGAGGGGCT CGAAGGCCAC
AAGGCGCTGC ACGATGCGTT TGGTGCCGCC CGGAAGGCGC TGATCGCAAG CGGAGACGCC
TGA
 
Protein sequence
MTSKADATEA FVWIWLPGAT EPVVAGRLFQ EGERLLFNYG ASYRGRKTAI PIYEPELPLQ 
EGAIAPINGL QMASCIRDGS PDAWGRRVII NRLTGKKPDA DGVPEISELT YLLHSGSDRT
GALDFQASAT EYVPRRAAQA SLDELMEAAA LVEKGVPLTP ALAQALNHGT SIGGARPKAL
INDDTKKFIA KFSASNDTYS VVKAEFIAMR LASASGLDVA SVSITRAAHK DVLLIERFDR
RHTKEGWTRQ AMVSALTILG LDEMMARYAS YEDLAELIRH RFTAPKDTLK ELYGRICFNV
LCGNTDDHAR NHAAFWDGKM MTLTPAYDIC PQSRTGTEAT QAMLIKGEGR ASTLANCLAA
APDYHLKEAD ATALIEHQIT TIAEQWQAVC AEAELTPVDR QFFAGRQFLN NYAIEGLEGH
KALHDAFGAA RKALIASGDA