Gene Smed_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1564 
Symbol 
ID5322422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1658206 
End bp1659939 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content62% 
IMG OID640790509 
Productankyrin 
Protein accessionYP_001327241 
Protein GI150396774 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.991027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0684375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGC GTGATCTTCC AGCCAACGCC AGTCTCGAAT CGATCCGCAA GCAGGCAAAG 
GCCCTTCTCA AAGCGTTCAA GGCGGGAGAC GGCGCCACTT TTCAACGCGT CAAACCCTAT
TTCGCAGATC CGCATGCCAT AGGGTTGCAG GAGGTTCAGC TTGTGCTGGC GCGTGAATTC
GGTTTCTCCG GTTGGACTAA GCTGAGGGCA CACCTGACGG GGACCCGCGG GGGCGAACAC
TCGGCTGAAC GGTTGGCCAA AAGGTTTCTT TCGCTCGCTA CACTGTCCTA CTTCGGCGAA
GTTCCGGCCG ATCCCGAGCG CTTCGGCGAA GCTCTGGAAC TATTGCATGC GCATCCGGAA
ATCGCCGATG AGAGCATTCA TGTCGCTGCG GCTCTCGGCG ATGCCGCCGC TGTCGGCCGT
TGGCTCGACA GCGAACCGCA GCTGATCGGA TACAAAGGCG GGCCCTACGA TTGGGAGCCA
TTGCTCTATG CAGCCTATGC CCGAATACCC GGCAGATCGT CGTTCGAAGC AGCGCGCCTT
CTGATCGGGC GCGGCGCAGA TCCGAACGCT TTCTGGCTCG ATGATGGGCA GTACCGCTTC
ACTGCGCTGA CAGGGGTGTT TGGACAGGGC GAGGCGGGAA AGGAACGGCA GCCGGAGCAT
CCCGACCGCC GTGCCTTCGC CCGGCTGCTT CTGGATGCGG GCGCCGAACC GAACGACAGC
CAGGCGCTTT ACAACTGCAT GTTCGAACCG GACAACACCT GCCTCTCGCT GCTGCTCGAG
TACGGCTTGA AGCCGAGTGA TCGAAACAAC TGGCTCCTGC GCGAGGATGG TCGCCTCGTC
GCCAATAGCG AAAGGGTCTT TGACTATCAG CTGGCCTGGG CGCTGGAGAA GAGAATGCCG
GAGCGTGTCC GACTTCTGGT GGAACACGGA GCCGATGTGA ACCGGATCGT TCGGGGCCGA
AGCCCCTATG AATGGGCGAA GCTCGGCGGC GATGATGCTC TCGCCGACTT CCTTGCGTCG
CGCGGTGCAC GGCGCGTGGA GCTGTCTTAC ATCGATCGGC TGATTCGGGC GATTGGCGCT
GAGCGGCACG ATGAGGCCAT GGAGTTGGTG TGCGCGAGAC CGGATCTTGT AGTGCACGTG
GAAGAGGCAC ATCCCTCACT GCTGCATGAG GCGGCGGGTG ACGGTCGGCA TGGCCAGGTT
TCGCTGATGC TGGCGCTTGG ATTCAACGTC AACCGGATGA CGTCCCGCAC GCCTCTTCAT
GAAGCGGCAC TGCACGGAGA CCTCGCGATG GCGAAGCGCC TGATCGAAAA CGCTGCCGAC
CCGACGCTCC GTGACCCGTA TCATCAGGCG CCGCCTATCG GGTGGGCTGA GTATAACGGC
AAGGAGGAGA TGGTGCGCTA TCTCATAACC CAGCCGCTTG ATGTCTTCGC TGCTGCGGCT
TTCGGCAATG CCGAGCGGGT GGGGGCAATG CTCGATGCGC AGCCGGAACA GCTGGAGATG
ACGTTCGGCG AGTTTCGCGG TGGCGGCAGC CCTGACCCGC GGCGCGACTG GATGACACCG
CTTGCCTTCG CGGTCGCAGG CAGCCGCAAG AATGTGGTCA AGCTGCTGAT GGGGCGAGGG
GCAAATCTCG CGCGGCGCGA TCCCGGCGGC CCATCAATCC GGGACCTGGC CCGCGAATCG
GGAGACAAGG AAATCCTGGA TTTGCTGGCC AGCACGGCGG CTGCACGACG GTGA
 
Protein sequence
MITRDLPANA SLESIRKQAK ALLKAFKAGD GATFQRVKPY FADPHAIGLQ EVQLVLAREF 
GFSGWTKLRA HLTGTRGGEH SAERLAKRFL SLATLSYFGE VPADPERFGE ALELLHAHPE
IADESIHVAA ALGDAAAVGR WLDSEPQLIG YKGGPYDWEP LLYAAYARIP GRSSFEAARL
LIGRGADPNA FWLDDGQYRF TALTGVFGQG EAGKERQPEH PDRRAFARLL LDAGAEPNDS
QALYNCMFEP DNTCLSLLLE YGLKPSDRNN WLLREDGRLV ANSERVFDYQ LAWALEKRMP
ERVRLLVEHG ADVNRIVRGR SPYEWAKLGG DDALADFLAS RGARRVELSY IDRLIRAIGA
ERHDEAMELV CARPDLVVHV EEAHPSLLHE AAGDGRHGQV SLMLALGFNV NRMTSRTPLH
EAALHGDLAM AKRLIENAAD PTLRDPYHQA PPIGWAEYNG KEEMVRYLIT QPLDVFAAAA
FGNAERVGAM LDAQPEQLEM TFGEFRGGGS PDPRRDWMTP LAFAVAGSRK NVVKLLMGRG
ANLARRDPGG PSIRDLARES GDKEILDLLA STAAARR