Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5078 |
Symbol | |
ID | 5704213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5752445 |
End bp | 5755384 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641274470 |
Product | glycosyl transferase family protein |
Protein accession | YP_001539811 |
Protein GI | 159040558 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0744] Membrane carboxypeptidase (penicillin-binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000111802 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACTCGT ACGGCGACCC CAGTTCCCCG CGCGGGCGGG CCCGGATACC GGGCCAGCAC GACCCCGGGC TGGCCGACGA CGCGTACCGT GCCCCGAATG ACGAGGTGCG TGGCCGGGCG GCCGCGCCGG AGGCATCGGC GGGCCGTGCC TCGGTAACAC CCGGCGGCGG TGCCTCCAGC GGGCGGGCCT CGGTTGGCGG CTCGGCGTCC GTCCCCTCCC CGGCTGCCGC CGGCACGGCC TCGGTCGGTC GCGCCTCGGC GTCGGTATCG GCTGCGCCCG GCCGTGCATC TGTGCCGGCA CCGGCCGCGC CCGGTCGCGC CTCCGTGCCG GCACCGTCGA CGCCCGGTCG CGCATCGGTG CCGACATCCC CGGCGCCGGG TCGTGCCTCC GTGCCGGTCT CTCCGGCGCC GGGCGGCTCT ACTGGGCGCG CGACCGTCGG TGCGGCCTCC GTGGGAACGG CCTCGGCCGG CCGGGCTCGG GTCGGCACGG CGGCGGTCGA GGGGCGGGCC TCCGTGGCGC GGGCCAGCGT CGGACCGGCG TCCGCCGGTC CTGGTGGTCC GGGCGGCCCG AGTGGTCCGG GCAGGTCCAG GTCCGGCGGC CGTGATCCGA ACGCCGCCGC GCGGGCGAAG AAGCGGAAGC GGGCGAACAT GCTGATCGCC GCGTGCGCGG TCTTCATCAT GCTCGCCGGC GTGGGTGTAG TCGGCTTCGC CTATTACTCC ACCAACGTCG TTCTACCCAA CCAGATTCCA CTTCCGCAGT CGACGACGGT CTACACGAGT GACGGCAAGG GGCTGGTCGC CAAGCTTGGC AACGAGAACC GGACGCTCAT TACCGTCAGC CAGATGCCGC AGCACGTTCG CCACGCGGTG GCCGCTGCCG AGGACCGTAA CTTCTACCGG CACTCCGGCG TCGACTACAA GGGCATCGCC CGAGCAGCGT GGAACAACTT CACCGGTGGC CACCGACAGG GCGCGTCGAC GATCACGCAG CAGTACGCGC GTGGCGCCTA CGAGAGCCTC GAGGACGACA CCTACACCCG GAAGGTGCGG GAGGCGATCT TCGCCTCGAA GCTAAACGAC GATTTCAGCA AGGAAAAGAT CATGGAGAAC TATCTCAACG TGATCTATTT CGGACGCGGA GCGTACGGGA TCGAGGCCGC GGCGCAGACC TTCTTCGGCA AGGCCGCCAG CAAGTTGACC GTCGGCGAGG GCGCGGTGCT GGCTGGGATC ATCAAGCAGC CGGAGCCCTC CGCCACCCAC AACGGGTTCG ACCCGGCGAC CGCCCCGGAC GACGCGAAGT CGCGATGGGA CTACGTTCTC GACGGGATGG TGGCCGAGGG CTGGCTCGAC GCGGCGGAGC GGCCGACCGA ATATCCGAAG GTGAAGCCGC CGGCCGAGGG CGGCAACGGC TTCGGTGTGG CGACCCCACG CGGCAACGTC ATCAACTATG TGCGGGCGGA AATGGAGCAG TGGGGACTCT GCACCAACAC GGGTGCCGAC GAGGTCAAGC CCTCCTGTGC GGATGAGCTA CGCAAGGGCG GCTACAAGAT CACCACAACG ATCGACGACA AGATGCAGAC CGCTCTGGAG AAGGCGGCAC GAGCGGGGGT AAAGGGTTCG GTCCTCGACG GCCAGCCGGA CAATCTGATG GCTGCCGGAG TCGCGATAGA CCCCAAGACG GGCCGGGTGC TCGCCTACTA CGGTGGGGAA AGCGGTGGTG ACATCGACTT GGCCGGCAAG AACACCACAG ACGGCATCCT CTATGGTGGC CATCCCCCTG CCTCGTCCTT CAAGGTCTAC ACTCTTGCGG CCGCCATCGA GGCCGGCATC TCCGTCAACT CCCGGTGGGA CGCGACGCCC TTCACCCCCG AGGGGTACGA CGACAAGATC CATAACGCGA GCCGGAACGC GCACTGTGGT AAGTCCTGCA CCCTGGACGA GTCGACGGTC AAGTCGTACA ACGTTCCGTT CTTCCATGTG GCGGAGCAGA TCGGCCCGGA CAAGGTGGTC AGCATGGCCC GGCAGGCGGG TATCACCACC ATGTGGAACA ACGACGACCC CGCCACGCCG TTCAACCTGT TGGCGGAGAA GCCGGAGGAC CTGGCACCGA AGCAGTTCGA CCACGTGGTG GGCTATGGCG CGTACCCGGT CACCGTCCTC GACCATGCCA ACGGTCTCGC GACGCTGGCG AACGACGGTC TATACCATAA GGCGCATTTC GTGCTCAAGG TCGAACAGAA GGACAAAACG ACCGGCGAGT GGAAGGTCGT CCGTGGGACC GGCGAGAAGC TGGACGGGCA GCAGCGGATC CGGAAAGGGG TCACCGACGA GGTGACCGCG GTGCTCAAGC AGATCCCGAG TGAGAACGGC GATGCCCTGT CCGGTCGTCG GCAAGCGGCC GGGAAGACCG GCACCTGGGA ACTCATCGGG ACCCCCCACA ACTCGAATGC CTGGATGGTC GGCTACGACG ACAACCTGGC GACAGCGATA TGGATCGGCG CTAACCCCGA GGCAGAAAGT AAAGCGATTC TCACCAAGAA CAAGAAGAAC ATCGGCGGTA GCGGTCTCCC GGCGGACCTG TGGAAGCGGT TCATGGACGA CGCGCTCAAC GGTAAGCCCA AGTCCGACCT GCCGCGCATC ACCGGAGTCG GCGACGACAC GGTCGGCAAC GGCGAGCAGC CGAAACCGGA GCCGCCGGAC TGCGGGTGGC TCGGCGGCCT GTTCTGCCCG GACGACGACG ACGATGATGA CGACAACGGC GGTGGTGGCG ACAACGGCGG TGGCGGTAAC AACGGCGGTG GCGGTAACAA CGGCGGTGGC GGTAACAACG GCGGTGGCGG TAACAACGGC GGTGGCGGTA ACAACGGCGG TGGCGGCGGT GGAGACATCG GGTTCCCGCC GCCGCCAACC GGAAACACCG AACGACCCAG GCGGGACTAG
|
Protein sequence | MNSYGDPSSP RGRARIPGQH DPGLADDAYR APNDEVRGRA AAPEASAGRA SVTPGGGASS GRASVGGSAS VPSPAAAGTA SVGRASASVS AAPGRASVPA PAAPGRASVP APSTPGRASV PTSPAPGRAS VPVSPAPGGS TGRATVGAAS VGTASAGRAR VGTAAVEGRA SVARASVGPA SAGPGGPGGP SGPGRSRSGG RDPNAAARAK KRKRANMLIA ACAVFIMLAG VGVVGFAYYS TNVVLPNQIP LPQSTTVYTS DGKGLVAKLG NENRTLITVS QMPQHVRHAV AAAEDRNFYR HSGVDYKGIA RAAWNNFTGG HRQGASTITQ QYARGAYESL EDDTYTRKVR EAIFASKLND DFSKEKIMEN YLNVIYFGRG AYGIEAAAQT FFGKAASKLT VGEGAVLAGI IKQPEPSATH NGFDPATAPD DAKSRWDYVL DGMVAEGWLD AAERPTEYPK VKPPAEGGNG FGVATPRGNV INYVRAEMEQ WGLCTNTGAD EVKPSCADEL RKGGYKITTT IDDKMQTALE KAARAGVKGS VLDGQPDNLM AAGVAIDPKT GRVLAYYGGE SGGDIDLAGK NTTDGILYGG HPPASSFKVY TLAAAIEAGI SVNSRWDATP FTPEGYDDKI HNASRNAHCG KSCTLDESTV KSYNVPFFHV AEQIGPDKVV SMARQAGITT MWNNDDPATP FNLLAEKPED LAPKQFDHVV GYGAYPVTVL DHANGLATLA NDGLYHKAHF VLKVEQKDKT TGEWKVVRGT GEKLDGQQRI RKGVTDEVTA VLKQIPSENG DALSGRRQAA GKTGTWELIG TPHNSNAWMV GYDDNLATAI WIGANPEAES KAILTKNKKN IGGSGLPADL WKRFMDDALN GKPKSDLPRI TGVGDDTVGN GEQPKPEPPD CGWLGGLFCP DDDDDDDDNG GGGDNGGGGN NGGGGNNGGG GNNGGGGNNG GGGNNGGGGG GDIGFPPPPT GNTERPRRD
|
| |