Gene Spro_4627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4627 
Symbol 
ID5604451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5103204 
End bp5105537 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content60% 
IMG OID640940193 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001480848 
Protein GI157372859 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACC CACTGAGCCG CATTATTGCA ACAGAACTGC AGGCCCGGCC GGAGCAAGTT 
GACTCCGCCA TCCGTCTGCT GGATGAAGGT AATACCGTGC CCTTTATTGC ACGCTATCGT
AAGGAAGTCA CCGGGGGTCT GGACGATACC CAACTGCGCC AGCTGGAAAC CCGCCTGGGT
TATCTGCGTG AACTGGAAGA CCGCCGTCAG ACCATCCTTA AATCAATCGA CGAGCAGGGC
AAACTGACCG AACAGCTGGC GGGGGCGATC ACCGCCACGC AAAGCAAAAC CGAACTTGAA
GATCTCTACC TGCCGTACAA ACCCAAGCGC CGCACCCGTG GGCAGATCGC GATTGAAGCC
GGTCTGGAGC CCCTGGCAGA CACCCTGTGG CAGGATCCTC AGCAGCAGCC TGAACAACTG
GCCGAAGGCT ACGTTGATGC CGACAAGGGC GTAGCGGACG TTAAAGCCGC CCTCGACGGC
GCGCGTTACA TTCTGATGGA GCGCTTTGCC GAAGACGCCG CGCTGCTGGC CAAGGTTCGT
AATTACCTGT GGAAGCACGC GCATCTGGTC TCCAAAGTGG TGGAAGGCAA AGAAGAAGCC
GGCGCGAAAT TCCGCGACTA CTTCGATCAC CACGAACCTA TTGCCCAGGT GCCTTCACAC
CGTGCGCTGG CCATGTTCCG TGGCCGCAAC GAAGGCGTGC TGCAACTGGC GCTGAACGCC
GACCCACAGT TTGAAGAAGC CCCGCGCGAA AGCCAGGCGG AACAGATCAT CATCAGCCAT
CTCGATTTGC GCCTGAATAA CGCCCCGGCA GATGCCTGGC GCAAAGCGGT GGTCAACTGG
ACCTGGCGCA TCAAGGTGTT GCTGCATCTG GAAACCGAAC TGATGAGCAC CCTGCGCGAA
CGTGCGGAAG ATGAAGCAAT CAACGTCTTC GCCCGTAACA TGCACGATTT GCTGATGGCC
GCACCGGCCG GCATGCGTGC GACCATGGGG CTGGATCCGG GCTTGCGTAC CGGGGTGAAA
GTGGCGGTGG TGGATGCCAC CGGCAAGCTG GTCGCCACCG ACACCGTCTA CCCGCACACC
GGCCAGGCCG CCAAAGCCGC GGCCATCGTC GCGGCACTGT GCATCAAACA CAAAGTGGAA
CTGGTCGCCA TCGGCAACGG TACCGCATCG CGTGAGACCG AGCGTTTTTA TCTTGATTTG
CAACAGCAAT TCGGCGAAGT CAAAGCGCAG AAAGTGATCG TCAGCGAAGC CGGTGCCTCG
GTGTATTCCG CCTCCGAACT GGCGGCGCTG GAGTTCCCGA ATCTCGACGT CTCGCTGCGT
GGCGCCGTCT CCATCGCCCG TCGCCTGCAG GATCCACTGG CCGAACTGGT CAAAATCGAT
CCGAAATCCA TCGGTGTTGG TCAGTACCAG CACGATGTCA GCCAAAGCCA ACTGGCGAAA
AAACTCGATT CGGTGGTTGA AGACTGCGTA AACGCCGTCG GGGTTGATCT GAACACCGCT
TCGGTGCCGC TGCTGACCCG CGTGGCCGGC CTGACCCGCA TGATGGCGCA GAACATCGTT
ACCTGGCGTG ATGAGAATGG CCGTTTCAGC AACCGCGAAC AGCTGTTGAA AGTCAGCCGC
CTGGGGCCAA AAGCCTTTGA GCAGTGCGCT GGCTTCCTGC GTATCAACCA CGGCGACAAC
CCGCTGGACG CCTCGACCGT TCACCCGGAA ACCTACCCGG TGGTCGAACG CATTCTGGCC
GCCACCCGCC AGGCACTGCA AGATCTGATG GGTAACCCGG CAGCGGTACG CAGCCTGAAG
GCCAGCGATT TCACCGACGA CAAGTTCGGC GTGCCAACGG TGACCGACAT CCTGAAAGAG
CTGGAGAAAC CGGGCCGCGA TCCGCGTCCG GAATTCAAAA CCGCCACCTT CGCCGAGGGC
GTTGAAACCC TGAGCGACCT GCAGCCGGGG ATGATTTTGG AAGGTTCAGT GACCAACGTC
ACCAACTTCG GAGCCTTTGT CGATATCGGC GTGCATCAGG ACGGTCTGGT GCATATCTCC
TCGTTGGCGG ACAAGTTTGT CGAAGATCCG CACACCGTGG TGAAAGCCGG TGACATCGTC
AAAGTGAAGG TGATGGAAGT GGATCTGCAG CGCAAACGCA TCGCGCTGAG CATGCGTCTG
GACGAGCAAC CGGGTGAAGG TTCACCACGC CGCGGCGGTA ACGCCGCCCC GGCCAGGGAC
AACGCCAACC GGGCACCGGT CAATAAGGGC AAACCGCGCG GCAACAACAA CACCTCGGCG
GGTAACAGCG CCATGGGTGA CGCGCTGGCG GCGGCATTCG GCAAAAAATC TTAA
 
Protein sequence
MNDPLSRIIA TELQARPEQV DSAIRLLDEG NTVPFIARYR KEVTGGLDDT QLRQLETRLG 
YLRELEDRRQ TILKSIDEQG KLTEQLAGAI TATQSKTELE DLYLPYKPKR RTRGQIAIEA
GLEPLADTLW QDPQQQPEQL AEGYVDADKG VADVKAALDG ARYILMERFA EDAALLAKVR
NYLWKHAHLV SKVVEGKEEA GAKFRDYFDH HEPIAQVPSH RALAMFRGRN EGVLQLALNA
DPQFEEAPRE SQAEQIIISH LDLRLNNAPA DAWRKAVVNW TWRIKVLLHL ETELMSTLRE
RAEDEAINVF ARNMHDLLMA APAGMRATMG LDPGLRTGVK VAVVDATGKL VATDTVYPHT
GQAAKAAAIV AALCIKHKVE LVAIGNGTAS RETERFYLDL QQQFGEVKAQ KVIVSEAGAS
VYSASELAAL EFPNLDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDVSQSQLAK
KLDSVVEDCV NAVGVDLNTA SVPLLTRVAG LTRMMAQNIV TWRDENGRFS NREQLLKVSR
LGPKAFEQCA GFLRINHGDN PLDASTVHPE TYPVVERILA ATRQALQDLM GNPAAVRSLK
ASDFTDDKFG VPTVTDILKE LEKPGRDPRP EFKTATFAEG VETLSDLQPG MILEGSVTNV
TNFGAFVDIG VHQDGLVHIS SLADKFVEDP HTVVKAGDIV KVKVMEVDLQ RKRIALSMRL
DEQPGEGSPR RGGNAAPARD NANRAPVNKG KPRGNNNTSA GNSAMGDALA AAFGKKS