Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0721 |
Symbol | |
ID | 4599825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 766438 |
End bp | 768258 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639775322 |
Product | general secretion pathway protein E |
Protein accession | YP_921933 |
Protein GI | 119714968 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.831257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATGG CCCAGATGCT CACCCGCAGC GGGCTGATCA CCGACGACCA GCTGCAGCGC GCGATGCTGG AGTACTCCCG GACCGGCGAC CCGCTCGGCG ACATCCTGGT GTCCCACGAG GCGATCACGG AGGACGTCCT CGTGGCCGCG CTGTCGGAGA TGTACCAGAT GCAGCGGGTC GGGCTGGCCG GCTTCACCCC CGACTTCGAG GTGGCCCGGC GGCTCCCGGA GCGGCTCGCC CACACCTTCC AGGCGGTGCC GGTCGCGGCG ACCGACGACC TGGTCCTCCT GGCCGTCGCC CGCCCCCTCG ACACCGAGCA GGCGGCCGAG GTCGAGGAGG CGCTCGGCTC GCCGTTCCGC CAGCTGCTGG CCAACCGCAC CGAGCTCGAC CAGCTGGTCC AGCGGGTCCA CGCCCGGCAC TACGCCGAGG TGTCCACCCG GCTGCTGATG GAGACCCGGC CGGAGGAGTC TGCCCACATC GTCATCTCGG GCGGCCAGAA GGCCGCGCTG GTGACCACCG CGATCGTGGT CGTGATGTGC GCGGTGATCT GGCCGATGGA GACGGCGATC GCCGTGGTCG CGGCGTGCAG CCTGCTCTAC CTCGTGGTGT CGTTCTACAA GTTCCGGCTC ACCCTGCGGG CGCTCGGGAC CCACCTGGAG ACCGACGTCA CCGACGAGGA GATCGCGGCG ATCGACGAGC GGCACCTGCC GACGTACACG ATCCTGGTCC CGCTCTACAA GGAGGCGGGC ATCGTCCCGC GTCTGGTCCG CGACATCAAC GCGCTCGACT ACCCCCGCAC CCGGCTCGAC GTGAAGCTGC TGTGCGAGGA GGACGACGAG GAGACGGTCC AGAGGATCCG GGACCTGCAG CTGCCGCCGC ACTTCCACCT CGTGGTGGTC CCGGACAGCC AGCCCAAGAC CAAGCCCAAG GCGTGCAACT ACGGGCTCCA GCTGGCTACG GGTGACTACT GCGTCATCTT CGACGCCGAG GACCGGCCGG ACCCCGACCA GCTGAAGAAG GCCATCATCG CCTTCAGCCG GGTGCCCGAG AACGTCGTGT GCGTCCAGGC GAAGCTCAAC CACTTCAACC AGGACCAGAA CATGCTCACG GCCTGGTTCG CCAACGAGTA CTCGATGCAC TTCGAGCTGG TGCTCCCGGC GATGGGCGCG GCCGAGTCCC CGATCCCGCT CGGGGGGACC TCGAACCACT TCGTCACCGC CAAGCTGCGC GAGCTGGGCG CCTGGGACCC GTTCAACGTG ACCGAGGACG CCGACCTCGG CATCCGACTG CACCGCGAGG GCTACCGCAC GGCGATGATC GACTCCACCA CCCTGGAGGA GGCCAACTCC CAGGTCCCGA ACTGGATCCG CCAGCGCAGC CGCTGGAACA AGGGCTACAT CCAGACCTGG CTGGTCCACA TGCGCGCCCC CTTCGCGCTG CTGTCCCAGA CGGGCCTCAA GGGGTTCTTG AGCTTCAACC TCACGATGGG CAGCGCGTTC GTGCTGCTCC TCAACCCGAT CTTCTGGGCG CTCACGACGC TGTACGTCTT CACCCAGGCG GGCTTCATCG AGCAGCTGTT CCCCGGCATC ATCTTCTACG CCGCGAGCGC GCTGCTGTTC GTCGGCAACT TCGTGTTCGT CTACCTCAAC GTCGCGGGCA GCCTGCACCG CGGCGAGTTC GGGCTGACCC GGACGGCGCT GCTCTCCCCC CTCTACTGGG GCCTGATGAG CTGGGCCGCG TGGAAGGGCT TCATCCAGCT CTTCACGAAC CCGTTCTACT GGGAGAAGAC GGTGCACGGC CTGGACGAGG GCCACGGGTG A
|
Protein sequence | MQMAQMLTRS GLITDDQLQR AMLEYSRTGD PLGDILVSHE AITEDVLVAA LSEMYQMQRV GLAGFTPDFE VARRLPERLA HTFQAVPVAA TDDLVLLAVA RPLDTEQAAE VEEALGSPFR QLLANRTELD QLVQRVHARH YAEVSTRLLM ETRPEESAHI VISGGQKAAL VTTAIVVVMC AVIWPMETAI AVVAACSLLY LVVSFYKFRL TLRALGTHLE TDVTDEEIAA IDERHLPTYT ILVPLYKEAG IVPRLVRDIN ALDYPRTRLD VKLLCEEDDE ETVQRIRDLQ LPPHFHLVVV PDSQPKTKPK ACNYGLQLAT GDYCVIFDAE DRPDPDQLKK AIIAFSRVPE NVVCVQAKLN HFNQDQNMLT AWFANEYSMH FELVLPAMGA AESPIPLGGT SNHFVTAKLR ELGAWDPFNV TEDADLGIRL HREGYRTAMI DSTTLEEANS QVPNWIRQRS RWNKGYIQTW LVHMRAPFAL LSQTGLKGFL SFNLTMGSAF VLLLNPIFWA LTTLYVFTQA GFIEQLFPGI IFYAASALLF VGNFVFVYLN VAGSLHRGEF GLTRTALLSP LYWGLMSWAA WKGFIQLFTN PFYWEKTVHG LDEGHG
|
| |