Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1522 |
Symbol | |
ID | 3917197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1566401 |
End bp | 1567804 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640444263 |
Product | type II secretion system protein E |
Protein accession | YP_496797 |
Protein GI | 87199540 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGGG AAATCCGCCG CAGGAACGTA AAGCCAAGAT CGCCCATCGC GGTACCGGCC CTGCCTTTGC CCGGCGATAC GATGTTGACC CCAGACAGGA CGGACGAGGC TCAGGTCGGT GCCGCTCCCG CCGCAAATGA CGTTCTGCTT GGCATCAAGG TCGACATCCA TCGAGAGCTT CTCGACCGCG TCAATCTGGC GGCCATCGAA AAGCTTTCGC GAACCGACCT GGTTCGTGAA CTCTCTGACA TCATCGGCGG CATCCTGACC GAACGGAATA TCGCGCTCAA TCGTGTCGAG CGCGAAGATC TCGTTGAAGA CATCGTCGAT GAACTGGTCG GCCTCGGTCC GCTTGAGCCA CTCATCAAGG ATGACAGCAT ATCGGACATT CTCGTCAACG GTTACGAGAC AGTTTTCGTC GAACGCGGCG GCAAACTGCA GCGAGTATCG ACGCGGTTCC AGGATGAGCG GCACCTCCTG CGCATAATCC AGAAGATTGT CAGTGCCGTA GGTCGCCGCG TCGACGAATC CTCGCCATTT GTCGATGCGA GACTGGCGGA CGGTTCCCGC GTAAATGCGA TCGTTGCGCC GCTTGCTATC GACGGATCAC TGTTGTCGAT CCGCAAGTTC TCCAAGAAGC CGATCAGCAT GGCCCGAATG ATCGAGATTG GCAGCTTGTC AGAACCAATG GCGATTCTGC TCAAGGCCGT GGTTGAAGGT CGTCTCAACA TCATCATCTC TGGCGGCACC GGCTCGGGCA AGACGACGAT GCTCAATGCC TTGTCTTCGT ACATCGATGG CACCGAACGT ATCGTCACGA TCGAGGACTC GGCCGAACTT CAACTCCAGC AGGAGCACGT TGCGCGTTTG GAGACGCGCC CCCCCAACAT CGAGGGGCGC GGTGAGGTCA GCCAACGCGA TCTGGTCAAG AATGCCCTGC GCATGCGGCC TGACCGGATC ATCCTGGGGG AATGCCGTGC GGGCGAAGCC TTCGATATGC TTCAGGCGAT GAACACGGGG CATGACGGCT CGATGACGAC GGTACATGCA AACACTCCGC GCGATGCGCT GACGCGTATT GAACAGATGG TTGGCATGAG CGGCATCGAT ATTGCGCCTC GTTCGGTCCG GGCCCAGATC GGCTCGGCCG TCAACGTCGT GATCCAGATC GGCCGTCTTT CCGACGGTCG ACGCAAGACT CTCAGCATTT CCGAATTGAC CGGGATGGAG GGGGAAACGA TCACCATGCA GGAGATTTTC CGCTTCAACC AGCGTGGGCG CGACGAGCTC GGCAACGTCA TTGGCCATTT CGAAGCGACC GGCATCCGCC CCCGGTTCGC TGCACGCCTC GAGGCGAGTG GCATCCACCT CGCCGCCGAT CTATTCAAGC CGACGATGGG GTGA
|
Protein sequence | MNWEIRRRNV KPRSPIAVPA LPLPGDTMLT PDRTDEAQVG AAPAANDVLL GIKVDIHREL LDRVNLAAIE KLSRTDLVRE LSDIIGGILT ERNIALNRVE REDLVEDIVD ELVGLGPLEP LIKDDSISDI LVNGYETVFV ERGGKLQRVS TRFQDERHLL RIIQKIVSAV GRRVDESSPF VDARLADGSR VNAIVAPLAI DGSLLSIRKF SKKPISMARM IEIGSLSEPM AILLKAVVEG RLNIIISGGT GSGKTTMLNA LSSYIDGTER IVTIEDSAEL QLQQEHVARL ETRPPNIEGR GEVSQRDLVK NALRMRPDRI ILGECRAGEA FDMLQAMNTG HDGSMTTVHA NTPRDALTRI EQMVGMSGID IAPRSVRAQI GSAVNVVIQI GRLSDGRRKT LSISELTGME GETITMQEIF RFNQRGRDEL GNVIGHFEAT GIRPRFAARL EASGIHLAAD LFKPTMG
|
| |