Gene Spro_4190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4190 
Symbol 
ID5605129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4642182 
End bp4643252 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content56% 
IMG OID640939750 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001480412 
Protein GI157372423 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC TGAGCGCGGC AGAGCTACAG CAGAAAATAA AAAATGGCGA GATGATTAAC 
GCCTGCAACC TCGATGGTCT CGAGCTGCAA GGATACGACC TGTCCGGCGG AATTTTTCAG
GATGTCTCAC TGCTGGGGGC GAACCTGCAG GCGGCCAATT TACATGAAGC GGTATTTAAT
GAGTGTCTGT TGAATGGGGT GACGCTGAGC GGCGCCCGCA TGCAGCAGAG CGTGTTTAAC
GACTGTGAAA TGACGGCAAT CAGCGCCTGC GACACGCTAA TGGCGCAGTG CATCTTTAAT
CATTGCGAGC TTGGCAACGG CGATTTCTCC CGCAGCAGGT TCGACGCTTG CCAATTCATG
CGCAGCCCGC TATCCGGCAG CACCTTCAAC CAGTCCACGC TGGAACGTAC TACCTTCTTC
GAGAGCTCGC TGGATGAGGC GCAACTGGAG CAGTGCCAGT GCCTGCAGAC GACCTTTTTC
AATATCGACT TGCGCACGAC CCGTTTGGGC CACAGCCAAT TCGACCGTAC GGTCTTTTTC
AACTGCGATC AGCGTGGAAA AAACTATGCC CAGCAGCGCT TTAGCGGCTG TCAGTTTACC
GATAATCAGC TCGACGCGGT CGATTTTAAC GGGGCGCAGC TGACGCAGTG CAATTTCAAA
GGCGCTTCGC TGAGAAAGGC TCAACTCCGT AAGGTGAACG CCAGCCAGGC GCTGTTTATG
AGTGCGGACC TTACCGGAGC CAACTGTCAG GGCAGTTTGT TTGATCAGGC GTTGTTTATC
GGTGCCAGGT TACAACAGGC CGATTTTAGC CACAGCCGCC TGTTCCAGAG CATTTTGCAG
CAGGTGAAGG CCGAAGACAG CAACTTTGCC CTGTGCGATC TGACCTACAG CGATTTCACC
CATGCTGATT TGCGCCGGGC CGATTTCCGC AGTGCCACAT TTTCCCGCAC CCGGTTCCAT
CGGGCGCAGC AGGAGGGAGC CCACTTTTCC GATCGCCGGG GCATCCTTGA GTATGACGAA
GAGTTACTGG CTGCGGAGGC CTGGAGCCTT CAGCATCAGA GCCGCCGTTA A
 
Protein sequence
MTTLSAAELQ QKIKNGEMIN ACNLDGLELQ GYDLSGGIFQ DVSLLGANLQ AANLHEAVFN 
ECLLNGVTLS GARMQQSVFN DCEMTAISAC DTLMAQCIFN HCELGNGDFS RSRFDACQFM
RSPLSGSTFN QSTLERTTFF ESSLDEAQLE QCQCLQTTFF NIDLRTTRLG HSQFDRTVFF
NCDQRGKNYA QQRFSGCQFT DNQLDAVDFN GAQLTQCNFK GASLRKAQLR KVNASQALFM
SADLTGANCQ GSLFDQALFI GARLQQADFS HSRLFQSILQ QVKAEDSNFA LCDLTYSDFT
HADLRRADFR SATFSRTRFH RAQQEGAHFS DRRGILEYDE ELLAAEAWSL QHQSRR