Gene Smed_5805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5805 
Symbol 
ID5320107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp776359 
End bp777600 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID640777507 
ProductHipA domain-containing protein 
Protein accessionYP_001314439 
Protein GI150377844 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0420791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG CCCGCCCCGC CGAAACCGTC ATGTCTCTCG ATGTTCTCCT CAATGAGGTG 
AAGGTCGGCA CGATCGTCAG GACACCGGGC GACTTCAACG CCTTCAGCTT CGATCCTGCC
TACCGCGCGA GCGGAGGGCA TCCTGTGTTG AGCTTGTCCT TCCGCGCGGC GACGGGAGGA
TTGCGCAAGG ATCCACGCCC GGTGGCGGGA GCTTTGCCGG CCTTCTTCGC CAACCTCCTG
CCCGAGGACA AACTGCGCGA GGCGATGGAA AAGCATCACG CCCGGGCCGT CCGGCCCGGC
AATGATTTCG ATCTGCTGGC CGCGTTGGGC GCCGACCTGC CAGGCGCTGT GCGGGTTGTC
CCGAGTAACG GGGATATCGT CGCCGCAACG GCTGCAGTTG AGGGCGAGCC GAAAGCGCGC
TTCTCGCTCG CTGGCGTGCA GATGAAGCTT TCGGTGATGA AGAACACCGG CAAGGGCGGC
GGATTGACAG TGCCGCTCGG GGACAGCGAC GGTCAGTACA TCGCCAAATT TCCTTCGACC
GCCTTTCCAG GCGTGTCGGA GAACGAGTTC GCCAATCTCG CGCTGGCGGA AGCGATCGGC
ATGGACGTGC CCGAACGCGA GCTGGTCGGC AAAGACCAGT TCGAAGGCAT TCCCGAAGAG
TTCGAGACCC TGGCCGAGGG GCTGGTCCTT CTTATCAGGC GGTTCGATCG CGCCGACGGC
GCGCGACGAA TCCATATCGA GGATTTCGCC CAGGTCTTCG GCATCTATCC GGCACGAAAA
TACGAAGGCG CGGCCTATCA TGACATAGCC GCGGCGCTGA ATGTGGCGAT CTCGCCGGTA
GCGGCGCTGG AGTTCGTCCG CCGCCTGACC CTGTCGGTGG TCATGGGCAA TGGCGACATG
CACCTGAAAA ACTGGTCACT GATCTATCCC GGTGACGGTA ATTCGCCCGC CATGGCGCCG
ATCTACGACG TGCTTTCGAC GGTCCCCTAT ATTCCGGCTG ACAATCTGGC TTTGTCACTT
GGCGGCGAGC GCGCGTTCAA GGCACTGACG CTGCCACGGT GGAAGGCTTT CGCCAATCGG
GCGCGGCTTC CAGAACCCGC CGTCCTCAAA ACTGTCCGCG AGACGGTCAA GCAAATCGAT
GCGCATTGGT GGAAACTGCC GGAACGTGAT GCTGTCCCTT CGATCGTTCT GGAGCGGATC
GACGCCCATG TGAGGGCCAT GATGCCTGTC CTCTCGGATT GA
 
Protein sequence
MNNARPAETV MSLDVLLNEV KVGTIVRTPG DFNAFSFDPA YRASGGHPVL SLSFRAATGG 
LRKDPRPVAG ALPAFFANLL PEDKLREAME KHHARAVRPG NDFDLLAALG ADLPGAVRVV
PSNGDIVAAT AAVEGEPKAR FSLAGVQMKL SVMKNTGKGG GLTVPLGDSD GQYIAKFPST
AFPGVSENEF ANLALAEAIG MDVPERELVG KDQFEGIPEE FETLAEGLVL LIRRFDRADG
ARRIHIEDFA QVFGIYPARK YEGAAYHDIA AALNVAISPV AALEFVRRLT LSVVMGNGDM
HLKNWSLIYP GDGNSPAMAP IYDVLSTVPY IPADNLALSL GGERAFKALT LPRWKAFANR
ARLPEPAVLK TVRETVKQID AHWWKLPERD AVPSIVLERI DAHVRAMMPV LSD