Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2652 |
Symbol | |
ID | 3918426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2887977 |
End bp | 2889494 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445429 |
Product | type II secretion system protein E |
Protein accession | YP_497922 |
Protein GI | 87200665 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGGCGG GCGAAGACAT CGAGGGCATG ACCGTTGCGG TCGCGCAGGC CGGGCCTGCG ATGCCCTATG CCTTCGCGCG CGACCGCGGC GTCCTCGTGG AAAGCCTGAA CGACACCGAG GTTACGGTGG CCCTGCGCGA GGGCGGCGAT CCGCTCGCGT TGCTCGAAAT CAGGCGCGTA CACAAACGAC ACCTAAACGT ACACGAAGTG ACCACAACCG AGTTCGAGAA GCTCCTCGCC ACGGCCTATG CGGTCGACGG GGCGGCGGCT GCGGTGGCCG GCGACATGGG CATCGCGGGC GACGCGCTCG ACCCTCTCGC GCTCGGCCTG CCGACCGCAG AGGACCTCCT CGACAGCGCC GATGACGCGC CCGCGATCCG CCTGATCAAT GGCCTGATCG CCGAGAGCTT GCGCCAGGGC GTGTCGGACA TCCACATCGA ACCTTACGAA ACCGCGCTTG TCGTGAGGAT GCGGGTCGAT GGCGTGCTGA CCGAGAAGCT GCGCATGCCG CCGCATGTCG CGCCGGTGCT GGTGAGCCGC ATCAAGGTCA TGGCGCGGCT CGACATTGCC GAACGGCGCG TGCCGCAGGA CGGCCGTATC TCGTTGAGCC TGGGCGGAAA ACTGGTCGAC GTGCGCGTCT CGACCCTGCC CAACCGCGCC GGCGAGCGAG TGGTCATGCG CCTGCTCGAC AAGGAAAATG CCGGACTGGA CCTCGTCCAT CTCGGCCTCG ACCCCAAGTC GGAAGACGTC CTGAGCCGGG CCCTGGCGGA GCCAAACGGC ATCGTGCTGG TGACCGGCCC GACGGGTTCG GGCAAGACGA CGACGCTCTA TGCGGCGTTG CGCGGCCTCA ACGATGGCGC GCGGAATATC CTTACTGTTG AAGACCCTGT GGAATACGCT GTGGATGGCG TCGGCCAGAC GCAGGTCAAC GCCAAAGTCG GGCTGACCTT CGCGGCGGGA TTGCGCGCGA TCCTCCGCCA GGACCCGGAC GTGGTCATGG TCGGCGAAAT CCGCGACCGG GAAACCGCTG AAATCGCGGT GCAGGCGTCG CTGACCGGCC ACCTCGTGCT GTCGACCGTG CATACCAACG ACGCGGCCGG CGCGGTCACG CGCATGCGCG ACATGGGGGT GGAGCCGTTC CTGCTGGCAT CGACCCTGCG CGCGGTGATC GCACAGCGGC TGGTGCGCCG GCTCTGCCCG CATTGCCGCG AGGAGCGTGT CCTCGACGCG GGAATGGCCG AAGTCCTCGG TCTTGAGGCG GGCAGCAAGG TCCGTGCCGC ACGCGGCTGC GCAGAGTGCG GCCAGACCGG ATACCAGGGC CGTATCGGCG TGTTCGAGGC GCTGCGGGTG GACGATGCGA TCCGCCAGAT GATCCACGAC AATGCCGACG AGGCGGCCAT CGCGCGCCAT GCCTTCGCCG ATGCGCCGAC CCTTGCGGGA TCCGTGCGCC GCCTTGTCGC CGAAGGAGCG ACGAGCCCCG AGGAAGCGGC ACGCATCATG CGGCGGGACG GGGCCTGA
|
Protein sequence | MEAGEDIEGM TVAVAQAGPA MPYAFARDRG VLVESLNDTE VTVALREGGD PLALLEIRRV HKRHLNVHEV TTTEFEKLLA TAYAVDGAAA AVAGDMGIAG DALDPLALGL PTAEDLLDSA DDAPAIRLIN GLIAESLRQG VSDIHIEPYE TALVVRMRVD GVLTEKLRMP PHVAPVLVSR IKVMARLDIA ERRVPQDGRI SLSLGGKLVD VRVSTLPNRA GERVVMRLLD KENAGLDLVH LGLDPKSEDV LSRALAEPNG IVLVTGPTGS GKTTTLYAAL RGLNDGARNI LTVEDPVEYA VDGVGQTQVN AKVGLTFAAG LRAILRQDPD VVMVGEIRDR ETAEIAVQAS LTGHLVLSTV HTNDAAGAVT RMRDMGVEPF LLASTLRAVI AQRLVRRLCP HCREERVLDA GMAEVLGLEA GSKVRAARGC AECGQTGYQG RIGVFEALRV DDAIRQMIHD NADEAAIARH AFADAPTLAG SVRRLVAEGA TSPEEAARIM RRDGA
|
| |