Gene Spro_4904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4904 
SymbolyieM 
ID5603666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5442134 
End bp5443597 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content58% 
IMG OID640940480 
Producthypothetical protein 
Protein accessionYP_001481124 
Protein GI157373135 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00108067 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000692167 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCAGCC TGGAAACACT CGATCTGCTG CTGGCGATCA CCGAAGGTGA ACTGATCGAG 
GAGATGATCA TCGGCATGCT GGCGGCACCG CAGCTGTCGA TTTTCTTCAA AAAGTTTCCG
GCGATCCGCC GGGCGCTGGA TCGCGACCTG CCGCGCTGGA AGCTACAGCT GAAAGAACGC
CTGCATGAAG CCATGGTGCC GCCCGCGCTG GCACAGGAAT TCTATCGTTA TCAACAGTGT
CAGCTAGAGA ACAACACCCA GTTCTTCCAC AACCTTAACG ACACCCTGGA TCTGCTGCGT
CAGCTGGTTT CTCCCTTTTA TGAGCAGGCG CGCTCACTGG TGGACGCCGC CGATCTGCCC
AACCATCCAC TGGATGACAG CTTCCAAACC CTGTTTCTGC AGCGCTGGCG TATCAGCCTG
ACGCTGCAAG CCACCATGAT GCATCACCAA TTGCTGGAAC AGGAACGCGA ACAGCTGATG
GCGGAGCTGC AGGAACGTCT GGCGCTGAGT GGCGCCCTGG AGCCGCTGCT GTCGGAAAAC
GACACCGCTG CCGGGCGGCT GTGGGATATG AGTAAAGGCC ATCTGCAACG GGGTGATTAT
CAACGGCTGG TGGAATACGG CAATTTCTTG CAGCAGCAAC CGGAACTGAA AAAGCTGGCA
CAGCAGCTTG GGCGCAGCTA TCAGGCCAAG GCGGTGCAAC AGCAGGACGC GTTGCCGGAG
CCTTTCCGCG TGATGGTGCA GGTGCCGGCC ACGCTACCGG AAGAGGTCAG CGGCATTCAC
CAGAGCGACG ATATTCTGCG CCTGCTGCCG CCCGAGTTGG TCACGCTGGG CATTGAAGAG
CTGGAGTTTG AGTTTTACCG CCGCCTGCTG GAAAAACGGC TTTTGAGCTA TCGTCTGCAG
GGCGATGTCT GGCAGGAACA GATCCATATC CGTCAGGTCA CTCACCAGCA GCAGGATCAA
CAACCTCGCG GGCCGTTTAT TGTCTGCGTG GATACCTCAG GTTCGATGGG CGGTTTCAAC
GAGCAGTGCG CCAAGGCTTT CTGTCTGGCG CTGTTGCGCA TTGCGCTGGC GGATAATCGC
CGCTGCTACA TCATGCTGTT CGCCAACCAG ATAGTGCATT ACGAACTGAC CGCCGCCAGC
GGTATTGAGC AGGCGGTTCG CTTTCTCGGT CAGCATTTTC GCGGCGGCAC CGATCTGGCG
GCCTGCCTGA ATGCCACGGT GAGCAAAATG ACGGAAAGCG GCTGGTTCGA CGCCGACGCG
GTGATCATTT CTGACTTTAT TGCCCAGCGA TTGCCGGAGG AGATAATAAA GAAGGTTAAA
CAACAGCAGC AAAACCACCA GCAGCGCTTT CACGCAGTGG CGATGTCCAA CTACGGCAAG
CCCGGTATCA TGCGTATCTT CGATCATATC TGGCGCTTTG ATACCGGGTT AAAAAGCCGC
TTAATGCGCC GCTGGCGGCG CTGA
 
Protein sequence
MLSLETLDLL LAITEGELIE EMIIGMLAAP QLSIFFKKFP AIRRALDRDL PRWKLQLKER 
LHEAMVPPAL AQEFYRYQQC QLENNTQFFH NLNDTLDLLR QLVSPFYEQA RSLVDAADLP
NHPLDDSFQT LFLQRWRISL TLQATMMHHQ LLEQEREQLM AELQERLALS GALEPLLSEN
DTAAGRLWDM SKGHLQRGDY QRLVEYGNFL QQQPELKKLA QQLGRSYQAK AVQQQDALPE
PFRVMVQVPA TLPEEVSGIH QSDDILRLLP PELVTLGIEE LEFEFYRRLL EKRLLSYRLQ
GDVWQEQIHI RQVTHQQQDQ QPRGPFIVCV DTSGSMGGFN EQCAKAFCLA LLRIALADNR
RCYIMLFANQ IVHYELTAAS GIEQAVRFLG QHFRGGTDLA ACLNATVSKM TESGWFDADA
VIISDFIAQR LPEEIIKKVK QQQQNHQQRF HAVAMSNYGK PGIMRIFDHI WRFDTGLKSR
LMRRWRR