Gene Spro_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2014 
Symbol 
ID5607450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2207066 
End bp2208346 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content61% 
IMG OID640937552 
Productgluconate 2-dehydrogenase (acceptor) 
Protein accessionYP_001478245 
Protein GI157370256 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.684291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0791101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAC TTACCCCCTC GCTGCTGGCG CTGTTGATGC TGGGTGCCAC CACGGTGCAG 
GCGAACGACG ATCCGCAACT GGCGGCCCGG CTGCAACACG GGGAATACCT GGCGCGCGCC
GGTGACTGCG CCGCCTGCCA TACCGCCCCC GGCGGCCAGC CCTTTTCCGG CGGTCTGAAA
ATGACCACCC CGGTCGGCGC TATCTATTCC ACCAACATCA CGCCGGACAA GCAAACCGGC
ATTGGCGAAT ACAGCCTGCA GGAATTCAGT GACGCGTTAC GCAAAGGCGT TGCTCGCGAC
GGCACCCGGC TCTACCCCGC GATGCCCTAT CCTTCGTTCG CCAAAATCAG TGATGGAGAC
ATACGCGACC TCTACCTGTA TTTCACCCGA CAGGTTAAAC CCGTGGTTCA GCAAAATAAG
GACAGCAGTA TCCCCTGGCC GCTGAGTATC CGCTGGCCGC TGGCCCTGTG GAATCTGGCC
TTTCGGGAGG ACGGTACCTA TCGGCCGGAC GTGACAAAAA GCCTCGACTG GAATCGCGGC
GCCTATCTGG TTCAGGGGCT GGGCCACTGC GGCTCCTGCC ACACGCCGCG CGGTATCGGC
TTTCAGGAAA AAGCGCTCAG CCAGAGCGAC GACGCCTATC TGAGTGGTGG CACGCTGGAA
GGCTGGCATG CTGCCAACCT GCGGGCGGAC GCCGTGAGCG GCCTCGGCCG CTGGAGCGCG
GAAGATATTA CCCGTTTCCT GAAGACCGGC CATAATCGGC GATTTGCCGC TTTCGGCTCA
ATGATTGACG TGGTACAAGA TAGCACTCAG CACCTGAGCG ACGCCGATTT ACGCGCCATC
GCCGGTTATC TGCAATCATT GCCGACCGTC AATAAGGAAA AGCCGCTGGA GCTGGATGGA
GGCACGGGCA AAATGCTGCT AAACGGTAAC GTCAGTCAGC CGGGTGCCCA GACCTATCTG
GATAACTGCG CCGCCTGCCA CCGCAGCGAC GGTCAGGGAT ATCGCGATAC CTTCCCGCAG
TTGGCGCTCA ACCCGGCGCT GTTAAGCGAC GATCCGTCCT CGCTAATCAG CATTATCCTC
AAGGGTTCCC GCACGCCGGT GACCGTCGGC GCGCCAACCG GGCTGACCAT GCCGGACTTT
GGCTGGCGCC TTGACGATGA ACAAATTGCT CAGCTCGCGA CCTTTATTCG CCACAGTTGG
GGCAATGATG CATCGGCGGT GACCGCCGCT CAGGTGCAGG ATATCCGCAA AAACAGCGTC
ACCAAACCAG AGCATCCTTA A
 
Protein sequence
MKGLTPSLLA LLMLGATTVQ ANDDPQLAAR LQHGEYLARA GDCAACHTAP GGQPFSGGLK 
MTTPVGAIYS TNITPDKQTG IGEYSLQEFS DALRKGVARD GTRLYPAMPY PSFAKISDGD
IRDLYLYFTR QVKPVVQQNK DSSIPWPLSI RWPLALWNLA FREDGTYRPD VTKSLDWNRG
AYLVQGLGHC GSCHTPRGIG FQEKALSQSD DAYLSGGTLE GWHAANLRAD AVSGLGRWSA
EDITRFLKTG HNRRFAAFGS MIDVVQDSTQ HLSDADLRAI AGYLQSLPTV NKEKPLELDG
GTGKMLLNGN VSQPGAQTYL DNCAACHRSD GQGYRDTFPQ LALNPALLSD DPSSLISIIL
KGSRTPVTVG APTGLTMPDF GWRLDDEQIA QLATFIRHSW GNDASAVTAA QVQDIRKNSV
TKPEHP