Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3556 |
Symbol | |
ID | 5705049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4103687 |
End bp | 4104808 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272983 |
Product | hypothetical protein |
Protein accession | YP_001538349 |
Protein GI | 159039096 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0984898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0100122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAAA AAAATCGGGG GATCGTCGTC GCCGCCACCG GTGACCTGCT CATCGCGCGC GATCGTCCGC ACGACATGTT TCGGCACGTG CGGGACCTGT TGACCGAGGC CGACATCACC GTCGGCCAGT TGGAGACGCC GTACTCCGAC CAGGGCTCCC GGAGTTCATG CGGGGCGCGC GGCGCGGTCC CGAACGATGT GGCGAACTTC GCGGCGATCC CGCACGCGGG CTTCGACGTC ATCTCCCTGG CGAGCAACCA CGCCGGCGAC TGGGGGGCCG ACGCGTTACT CGACTGCATC GAGCGGTGCC GGCGCCACGG CATCACCGTG GTGGGTGCGG GGGCGGACAT CGCCGAGGCG CGCCGGCCGG GGATCATCGA ACGCGATGGG ACCCGGGTCG GATTCCTGGC CTACTGCTCG GTCGCGCCGG ATGGCTACTA CGCCGGGCCG GGTAATCATG GTGTGGCGCC GATGCGGGCG AGAACACTCT ACGAACCGTT CCAGTTCGAC CAGCCCGGAG CTCCGCCCTC GGTCAGAACC CTGCCGGACG AATACGATCT GGCGGCGCTC GTCGCGAACA TCGGCGAGTT GCGCGACCAG GTGGATGTGC TGATCGTGTC GCTGCACTGG GGCCTGCTCT TTCAGCGCTC ACGGCTCGCG GACTACCAGC CGGTGGTGGC GCACGCGGCG ATCGACGCCG GCGCCGACGT GGTGATCGGG CACCACCCGC ACATCCTGAA GCCGGTGGAG GTCTACCGAG GCAAGGTCAT CTTCTACAGC CTGGGCGACT TCGCCCTCGA GATCAACGAG CGCTGGTGGC GGTCGTTCAG CCGGGAGTGG TTCGAGCGGG CGGTCCAGTT CTATCAGGCA CTCGCCCCCG GCCAGGATAT GCACGAGGAG GGCCGGAACT CGATGATCGT CCAGCTGCAC ATCGTCGACG GCCGCATCGA CCGGGTTGGC TTCGTACCCG TGACGATCAA CGATGCACGC GAGCCGGTGC CGTACCGGGC GGACACAGAG GACGGGCGCG CGGTCCGCGC CTACCTGGCG CAGATCACGG CCGAGGCGGG GATCGACACC ACCTTCGACG TGGTCGACGA CGAGGTCCTG GTCCGTATCT GA
|
Protein sequence | MGQKNRGIVV AATGDLLIAR DRPHDMFRHV RDLLTEADIT VGQLETPYSD QGSRSSCGAR GAVPNDVANF AAIPHAGFDV ISLASNHAGD WGADALLDCI ERCRRHGITV VGAGADIAEA RRPGIIERDG TRVGFLAYCS VAPDGYYAGP GNHGVAPMRA RTLYEPFQFD QPGAPPSVRT LPDEYDLAAL VANIGELRDQ VDVLIVSLHW GLLFQRSRLA DYQPVVAHAA IDAGADVVIG HHPHILKPVE VYRGKVIFYS LGDFALEINE RWWRSFSREW FERAVQFYQA LAPGQDMHEE GRNSMIVQLH IVDGRIDRVG FVPVTINDAR EPVPYRADTE DGRAVRAYLA QITAEAGIDT TFDVVDDEVL VRI
|
| |