Gene Smed_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1895 
Symbol 
ID5322753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1961527 
End bp1962903 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content62% 
IMG OID640790832 
ProductHI1409 family phage-associated protein 
Protein accessionYP_001327564 
Protein GI150397097 
COG category[S] Function unknown 
COG ID[COG3567] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01555] phage-related protein, HI1409 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA TTATCGCGTT CGTACGCGAC AGCCTGACAA ACATAGTCGC CAGCCTGGGT 
ACCAGCCGGG ACAAGGCAGC GGCTAACGTC TATTCGATGC CAATGCTCAC CGACGAGGAA
CTGCTCAACG CCTATCGCGG CGCGTGGCTC CCGAAGAAGA TCGTAGACAT CCCGGCATTC
GACAGCATCC GTGCGTGGCG CGATTGGCAG GCGAAGAAGC CGCAGATCGA AGCGATCGAG
GCCGAAGAGA AGCGGCTGAA CGTTATGGGC AAGCTGCTGG AGACCCGCAT CAAAGCGCGA
CTCTGGGGCG GCGCTGCTCT CGTCATCGGT ACCGGCGACA AGGACCTGAC GGCGCCGCTC
GACGTCGAGC GCATCACGAA GGGCGGCCTG AAATACCTCA CGGTCATGAC GCGCCGCCAC
CTCACCGCCG GCAAGATCGA AAGAGATCCG GCGTCGGAGT GGTATGGCAA GCCGAAGGTC
TATCAGCTGA ACTCGGCCGA TGGCGCGCAA ATCGAAATAC ATCCGTCGCG CCTGGTCATC
TTCAACGGCA GCCAGCAGCC GGACGAGGAC ATCGTAACGA CCACCTATGC CGGCTGGGGC
GACAGCGTCC TCTTGTCGGT GGTCGATGCA ATCAAGCAGG CCGACGGTAC CGCGGCGAAC
ATTGCCAGCC TCGTTTTCGA GGCGAAGGTC AACGTGATCC GTATTCCGGA TTTCATGCAG
AACCTCGGTA ACGCAGAGTA CCGCGCCAAG ATCCTCGAGC GCTATACGCT TGCGGCGACG
GCAAAGGGCA TAAACGGCGA CCTGCTGCTC GACAAGGAAG AGGAATACGA GCAGAAGACG
GCCAGCTTCG CCACGCTGCC AGAAGTCCTG ATGTCGTTCC TGCAGATCGT CTCCGGCGCC
GCGGACATTC CGGCTACCAG ACTTCTCGGC CAGTCGCCGG CCGGCATGAA CGCCACCGGC
GAAAGCGACC TGCGCAACTA TTACGACCGC TTACAGGCTA TGCAGACCGT CGAGATGACG
CCGGCGATGG CGCGCCTCGA CGAGTGCCTG ATCCGGAGCG CTCTCGGCTC TCGCGACCCG
GACATCTACT ACGACTGGGC GCCGCTCTGG GGCATGTCGG AGAAGGAAAA GGCCGACGTC
TTCAAGACGA AAGCGGACGC GGCTCGGCAG TTGGTCGGAA GCGGTACGGG ACAGGAGATC
ATCCCTCGCG ATGCCGTTTC CGATGCTCTG GTCAACACCT TCATCGAAGA CGGCTCGCTG
CCCGGTCTCG ATGCAGCGAT CGAGGAGTAC GGCAAGCTTT CTGAACAGGA GCCGGATGAG
GAAGAGCGCG CCGCGGCAGC CACACAGACA TCTGCAGCAA TGAATCCGAG CGGCTGA
 
Protein sequence
MANIIAFVRD SLTNIVASLG TSRDKAAANV YSMPMLTDEE LLNAYRGAWL PKKIVDIPAF 
DSIRAWRDWQ AKKPQIEAIE AEEKRLNVMG KLLETRIKAR LWGGAALVIG TGDKDLTAPL
DVERITKGGL KYLTVMTRRH LTAGKIERDP ASEWYGKPKV YQLNSADGAQ IEIHPSRLVI
FNGSQQPDED IVTTTYAGWG DSVLLSVVDA IKQADGTAAN IASLVFEAKV NVIRIPDFMQ
NLGNAEYRAK ILERYTLAAT AKGINGDLLL DKEEEYEQKT ASFATLPEVL MSFLQIVSGA
ADIPATRLLG QSPAGMNATG ESDLRNYYDR LQAMQTVEMT PAMARLDECL IRSALGSRDP
DIYYDWAPLW GMSEKEKADV FKTKADAARQ LVGSGTGQEI IPRDAVSDAL VNTFIEDGSL
PGLDAAIEEY GKLSEQEPDE EERAAAATQT SAAMNPSG