Gene Spro_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1980 
Symbol 
ID5603337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2166554 
End bp2167957 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content50% 
IMG OID640937518 
Producthypothetical protein 
Protein accessionYP_001478211 
Protein GI157370222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00207336 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCACGCCT ACGAACTTGG CCGAAAACTG GCAGCCCTAT CGGACCGGGC CAGAGCGGCC 
CTCTCTGGAA TTGAAATCAA TCAGGCAGAT GCTCGCCACC TGCAACTGGT GCTGAAACAG
CAAATCGCCA GCCTCTATCA CCAGATGGCC ACAGTACTTC TGAAAGAATC TCCTCGCGTT
GGCGAAATGG CAAACGTGCA GAAATTACTA AATAAACTTC AGCAAGAGAT TGCCCAGTCT
CAACAGGCCT TACAGCTCAG CGAGAAAGAA ATCGAGAACC GACATGCTCA GTTGCAACAG
TTGCATGTAG AAATAAATCG GCAAGAAGAT GAACGAGATC GGCAACTACA ACAAAATATT
GATGCTGTAG CAGCCAGGCA GTTGATGGTT AAAAGCAGTC AGGATGTCGG CCAACAGGAA
AACAGCCATA AGGAATTAAT GGCTGAAACG ACCACCAAGC TAGGGGAGTA CCATCAGCAA
CGCATTTTCA TGTTCCTGGA GAGAAAAGGC TATGGGCAAC CGGGATATTC AGCCTGGCCG
CTGTCGCGCA ATCTCGATAG CTGGCTGGCA CGAATTAGCC ACTATCCGAA CAACGCAGCC
AATTACCGGA TGCTCCTGGC ACTGCAAAAG GAATCTACTC GGCGTCTGCT GGAGTTGCAC
AATCAAACCA AGCAACAGAC CAATGAATAT CAACAATATG TACAAGACGT CGAACAGAAT
CTACAGCTGC CCACGCTGTA TCTGAATCTC AACGCGCTAG AAAAAATACT GAGTTCCACA
CAACAAGAAA TCCGCAACCA GCAAAATAAG CTGCAGGAAT ACGCTCAGGG CGAAGGTGAG
ACTTACAGCC AGATAACAAC ACAGCTCTCG GCGCAAATGG CCTTATTGCC TCTGGATAAA
TTGGATATTC TGGTGGCGAA AACCGAGACG CCGACTGACG ATCGCCTGTT GGAAGAGCTG
AGAGAACTAC AGCTTGAAGA GATAGCGGTC GAAGAGAACC TGCTACAGCA AGAGATCGAC
GCTCAATTAG CCCAGCGCCG TGCCACTGCC GCCATCGAAT TGAACAATCA ATTTTCGGCG
CAGGGATATG ACAATCCGAT TTATGAGTAC CAATGGAGAT GGTCAGATAA ACCGGAGGAA
CTGTTTGAAA ACTACCTGTC CGGTGCCATC AGTCTGAAAG CGGTATTGCA TAAGTTGGAT
ATGATTACCC ACCGACTCCC ACCACCATCG GTAACTGCCA GAGCCTCGTC CGGTTCGTGG
GGAAATTCAT CCGCACGGAG CTATTCGTCT GGCGTCGGAT ATAGTTCTTC ATCTTCATCG
TCCTCTTCCG GCAGCGGTGG TAGTGGCTTC AGTTCTTCTT CTTCAACCGG CGGTGGCGGG
TTCCGCACTA CGGATAGTTT CTAA
 
Protein sequence
MHAYELGRKL AALSDRARAA LSGIEINQAD ARHLQLVLKQ QIASLYHQMA TVLLKESPRV 
GEMANVQKLL NKLQQEIAQS QQALQLSEKE IENRHAQLQQ LHVEINRQED ERDRQLQQNI
DAVAARQLMV KSSQDVGQQE NSHKELMAET TTKLGEYHQQ RIFMFLERKG YGQPGYSAWP
LSRNLDSWLA RISHYPNNAA NYRMLLALQK ESTRRLLELH NQTKQQTNEY QQYVQDVEQN
LQLPTLYLNL NALEKILSST QQEIRNQQNK LQEYAQGEGE TYSQITTQLS AQMALLPLDK
LDILVAKTET PTDDRLLEEL RELQLEEIAV EENLLQQEID AQLAQRRATA AIELNNQFSA
QGYDNPIYEY QWRWSDKPEE LFENYLSGAI SLKAVLHKLD MITHRLPPPS VTARASSGSW
GNSSARSYSS GVGYSSSSSS SSSGSGGSGF SSSSSTGGGG FRTTDSF