Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5070 |
Symbol | |
ID | 5704590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5739296 |
End bp | 5741431 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274463 |
Product | hypothetical protein |
Protein accession | YP_001539804 |
Protein GI | 159040551 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000565687 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACCG GACAGGCTGG CGGCGCCCAG CCACCACACC CCGAGGCGGC CAGGGTCACT GACCCTGCCG CAACGCCGAC CAGCCGCCGG GCCGACAAGG ACAAACCCGA GACCACCACG AGCAAGCACG AGACCGACAC AAGCAAGACC AAGACCGACA CAAGCAAGAC CAAGACCGAC ACAAGCAAGA CCAAGACCGA CACAAGCAAG ACCAAGACCG ACACAAGCAA GACCGACACC GACACCGACA CCGACACCGA CACGGATAAG CCCGAGAACA GCACGGGTGA CGGCGCGTCT CCCACAGGAG ACGGTTCCCC GGCTGACAAG GACGCCACCG ATACGGCCGA AAAGGATGAA CGGGATCCGT TCACCTCCTT CGCTCCCGCT CCGGAGCCGG TCCCGACCCG CGCCGGCCGG GTAGCCCGCG CTGTCGGACG GCTCCTGGCC CACGAGTGGA CGCTCGCGGT ACTCGGCTCA CTGGCCCTGG CCGTCGCGAT GACCTGGCCG ACGCTGCGCT ACCCGCGCTA CACCCTGCCC CAGGACTACT GGGATCCGAG CCTGCAGGCA TGGCAGATGG CCTGGACCGG GCACGCGCTG TTGACCGATC CGGGTCGGCT GTTGCACTCC AACGCGTTCT TCCCGGAACG GTGGAGCTTC GTCTTCTCCG ACACGTTGCT GGGGTACGCC CCCGCGGGGA TGATCGGTTC CGGCCCCGAA GCCGCCCTGC TCCGCTACAA CATCATGTTC GTACTGGCCC ACGCCCTCGC CACGTTCGGG GCGTACGCGC TGGCCCGACA GTTGGGGGCG GGCCGCATCG GCGCCGCGGT CGCCGGGGTG ACGTACACGT ACGCACCCTG GCTGTTGGCC CAGGCCGGGC ACCTGCACGT CGTCTCCAAC GGCGGTATAC CGCTGGCGCT GGCCATGCTC GCCCGAGGAC ACGGCTGGTC GCTGCGGCAC GGCTACCGCC CGGAGAAACG GCGGGTCCGG TGGGTGTACG CCGGGTGGCT GGTCGCCGCC TGGCAGCTGA GCCTCGGGTT CGGCATCGGG CTGCCGTTCG CGTACGTCCT CGCCGGGATC GGGCTGGTCG CCACGATCAC CTGGGTGCTG CGACGCTGGG TCGTCCGCCC GGTTCGGCGC CCCTTCGGTC GACGTCTGTT CGCCGCCGAC GCGGTGGGCG GGCTGATCTT CGCCGCGGTC GGAGGGCTAC TGGCCCTGCC CTACTTCACG GTCGCCAAGC TGCACCCGCA GGCGGAGCGA ACCCTTGACG ACCTCGCCTG GTACTCCCCG CCCGCGACCG GCTTCCTCAC CGCACCAGCC GAGTCGCGGG TCTGGGGTGA CCTGCACGAG GGGGCCCGGT CGACATTGCC ATGGCACCCG GAGATGACGT TGCTGCCCGG CTATGTCCTC TACGCCCTCG CGTTGGGCGG CCTCTTCTAT TCGGTCTGGC GAGTCCGGCA CCGGCTCCTG CTGCTGGCCG GGGTCGTGGT CAGCATGGTG TTGGCGATGG GGACCGAGTT CTTCGACGGA CGCTTCACCC TCGTGCCGCT CTTCGAGCAC GTACCCGGGT GGAACGGCCT GCGTACGCCG GGCCGGCTGA TGCTCTGGGC GACGCTGCTG CTCGGTCTGC TCGCGGCCGG CTCGGTGAGT GCCTTCGCCA GCCGGGTCCG AAAGATCTCC GCCGCGCGGA TCCCTTCCTG GCCGAACCCG TGGCTGCGGT TGGCCACGCT GCTTCCACTG CTGTTGGTGA CGGTGGAAGG GCTCAACCGC ACCCCACACC CTGTGGTTCC GGTGCAGCCC GCGGCGATGC GCACGGTGGA TGGCCCGCTA CTGGTGCTGC CCAGCGACCA GAGCCATGAC CAGCCGGTCA TGCTCTGGTC GACCACCCAC TTCCAGGACA TCGTCAACGG CGGCAGTGGC TTCACCCCGA CCCTGCTCGA GGACGTCCGT GAGGTGACGA CCGCGTTCCC CGACCAGGCG AGCGTCGAGT ACCTCCGCAC CCTGGGCGTT CGCAACGTGC TGATCCCCCG TGATCTGGTG GCCGGCACGC CCTGGGAGAT CAGCATCGAC GCGCCGGTCG ACGGGCTGGG CATCACCCGG GAGGAGATCG GCAACGTGGT CGTCTACCGT CTGTGA
|
Protein sequence | MTTGQAGGAQ PPHPEAARVT DPAATPTSRR ADKDKPETTT SKHETDTSKT KTDTSKTKTD TSKTKTDTSK TKTDTSKTDT DTDTDTDTDK PENSTGDGAS PTGDGSPADK DATDTAEKDE RDPFTSFAPA PEPVPTRAGR VARAVGRLLA HEWTLAVLGS LALAVAMTWP TLRYPRYTLP QDYWDPSLQA WQMAWTGHAL LTDPGRLLHS NAFFPERWSF VFSDTLLGYA PAGMIGSGPE AALLRYNIMF VLAHALATFG AYALARQLGA GRIGAAVAGV TYTYAPWLLA QAGHLHVVSN GGIPLALAML ARGHGWSLRH GYRPEKRRVR WVYAGWLVAA WQLSLGFGIG LPFAYVLAGI GLVATITWVL RRWVVRPVRR PFGRRLFAAD AVGGLIFAAV GGLLALPYFT VAKLHPQAER TLDDLAWYSP PATGFLTAPA ESRVWGDLHE GARSTLPWHP EMTLLPGYVL YALALGGLFY SVWRVRHRLL LLAGVVVSMV LAMGTEFFDG RFTLVPLFEH VPGWNGLRTP GRLMLWATLL LGLLAAGSVS AFASRVRKIS AARIPSWPNP WLRLATLLPL LLVTVEGLNR TPHPVVPVQP AAMRTVDGPL LVLPSDQSHD QPVMLWSTTH FQDIVNGGSG FTPTLLEDVR EVTTAFPDQA SVEYLRTLGV RNVLIPRDLV AGTPWEISID APVDGLGITR EEIGNVVVYR L
|
| |