Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4190 |
Symbol | |
ID | 5605129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 4642182 |
End bp | 4643252 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640939750 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001480412 |
Protein GI | 157372423 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC TGAGCGCGGC AGAGCTACAG CAGAAAATAA AAAATGGCGA GATGATTAAC GCCTGCAACC TCGATGGTCT CGAGCTGCAA GGATACGACC TGTCCGGCGG AATTTTTCAG GATGTCTCAC TGCTGGGGGC GAACCTGCAG GCGGCCAATT TACATGAAGC GGTATTTAAT GAGTGTCTGT TGAATGGGGT GACGCTGAGC GGCGCCCGCA TGCAGCAGAG CGTGTTTAAC GACTGTGAAA TGACGGCAAT CAGCGCCTGC GACACGCTAA TGGCGCAGTG CATCTTTAAT CATTGCGAGC TTGGCAACGG CGATTTCTCC CGCAGCAGGT TCGACGCTTG CCAATTCATG CGCAGCCCGC TATCCGGCAG CACCTTCAAC CAGTCCACGC TGGAACGTAC TACCTTCTTC GAGAGCTCGC TGGATGAGGC GCAACTGGAG CAGTGCCAGT GCCTGCAGAC GACCTTTTTC AATATCGACT TGCGCACGAC CCGTTTGGGC CACAGCCAAT TCGACCGTAC GGTCTTTTTC AACTGCGATC AGCGTGGAAA AAACTATGCC CAGCAGCGCT TTAGCGGCTG TCAGTTTACC GATAATCAGC TCGACGCGGT CGATTTTAAC GGGGCGCAGC TGACGCAGTG CAATTTCAAA GGCGCTTCGC TGAGAAAGGC TCAACTCCGT AAGGTGAACG CCAGCCAGGC GCTGTTTATG AGTGCGGACC TTACCGGAGC CAACTGTCAG GGCAGTTTGT TTGATCAGGC GTTGTTTATC GGTGCCAGGT TACAACAGGC CGATTTTAGC CACAGCCGCC TGTTCCAGAG CATTTTGCAG CAGGTGAAGG CCGAAGACAG CAACTTTGCC CTGTGCGATC TGACCTACAG CGATTTCACC CATGCTGATT TGCGCCGGGC CGATTTCCGC AGTGCCACAT TTTCCCGCAC CCGGTTCCAT CGGGCGCAGC AGGAGGGAGC CCACTTTTCC GATCGCCGGG GCATCCTTGA GTATGACGAA GAGTTACTGG CTGCGGAGGC CTGGAGCCTT CAGCATCAGA GCCGCCGTTA A
|
Protein sequence | MTTLSAAELQ QKIKNGEMIN ACNLDGLELQ GYDLSGGIFQ DVSLLGANLQ AANLHEAVFN ECLLNGVTLS GARMQQSVFN DCEMTAISAC DTLMAQCIFN HCELGNGDFS RSRFDACQFM RSPLSGSTFN QSTLERTTFF ESSLDEAQLE QCQCLQTTFF NIDLRTTRLG HSQFDRTVFF NCDQRGKNYA QQRFSGCQFT DNQLDAVDFN GAQLTQCNFK GASLRKAQLR KVNASQALFM SADLTGANCQ GSLFDQALFI GARLQQADFS HSRLFQSILQ QVKAEDSNFA LCDLTYSDFT HADLRRADFR SATFSRTRFH RAQQEGAHFS DRRGILEYDE ELLAAEAWSL QHQSRR
|
| |