Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1856 |
Symbol | |
ID | 3917077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1955826 |
End bp | 1957391 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444600 |
Product | type II secretion system protein E |
Protein accession | YP_497130 |
Protein GI | 87199873 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.808966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCTT TCGGGCGAAA GAACGGCATT GCAGGCAATT CCGGCGGAGG TCGCCCCTCG TTCGGCGTGG CCCGCCCGAT GCGTGGAGGC GAAGCCGGGG CCACGGGCGC CGCGCCGTCG TCGGCTTCCA CGCCGATGCC CGTCCCGATG CCCATGCCGA TGGGAGGGGA CCAGTTCCCG CCGATCCCGT CGATGGAGGA CATGGACGCC CCGTCGAGCG CGGCTGCCCT GAAGGCCGAT GCGCTGTCGC GCCTGGCGGA CCGCGCCAAC GCGGTTGCCG AGGGCAATTA CCAGGCCGAA GGGTTCGAAG CCTCGGTCCA CAAGATCAAG GAACAGGTGC TGCCGCGCCT GCTCGAGCGC GTCGATCCGG AAGCGGCCGC GACCCTGACC AAGGACGAGC TGTCCGAGGA ATTCCGGCCG ATCATCATGG AAGTGCTGGC CGAGCTGAAG GTCACGCTGA ACAGGCGCGA GCAGTTCGCG CTGGAAAAGG TGCTGATCGA CGAGCTGCTC GGCTTCGGTC CGCTGGAAGA GCTGCTCAAC GACCCGGACG TTTCCGACAT CATGGTCAAC GGACCTGAGC AGACCTACAT CGAAAAGAAG GGCAAGCTGC AACTTGCCCC GATCCGTTTT CGCGACGAAA GCCACCTGTT CCAGATCGCC CAGCGCATCG TCAACCAGGT CGGCCGCCGC GTCGACCAGA CCACGCCGCT GGCCGACGCC CGCCTCAAGG ACGGCAGCCG CGTCAACGTG ATCGTGCCGC CGCTGTCGCT GCGCGGCACC GCGATCTCGA TCCGCAAGTT CTCCGAAAAG CCGATCACGA TCGACATGCT GCGCGACTTC GGCTCGATGT CGGACAAGAT GGCGACCTGC CTCAAGATCG CCGGCGCAAG CCGCATGAAC GTGGTCATCT CGGGCGGTAC CGGTTCGGGC AAGACGACGA TGCTCAACGC CCTGTCGAAG ATGATCGACC CGGGCGAGCG CGTGTTGACC ATCGAGGACG CGGCCGAACT CCGCTTGCAG CAGCCGCACT GGCTTCCGCT GGAAACGCGT CCGCCGAACC TTGAAGGCCA GGGTGCGATC ACCATCGGCG ACCTTGTGAA GAACGCGCTG CGCATGCGTC CCGACCGCAT CATCCTGGGC GAAATCCGTG GCGCAGAATG CTTCGACCTT CTGGCGGCGA TGAACACCGG CCACGACGGG TCGATGTGCA CGCTTCACGC CAACAGCCCG CGCGAATGCC TTGGCCGTAT GGAAAACATG ATCCTGATGG GCGATATCAA GATCCCCAAG GAGGCCATCA GCCGCCAGAT CGCGGAATCG GTCGACCTGA TCGTTCAGGT CAAGCGCCTG CGCGATGGTT CGCGTCGCAC GACCAACATC ACCGAAGTGA TCGGGATGGA AGGCGACGTG ATCGTGACGC AGGAACTGTT CAAGTTCGAA TATCTGGACG AGACCGACGA AGGCAAGATC ATCGGCGAGT TCCGGTCCTC CGGCCTGCGC CCATACACTC TTGAAAAGGC GCGCCAGTTC GGGTTCGACC AGGCCTATCT CGAGGCCTGC CTCTAG
|
Protein sequence | MSAFGRKNGI AGNSGGGRPS FGVARPMRGG EAGATGAAPS SASTPMPVPM PMPMGGDQFP PIPSMEDMDA PSSAAALKAD ALSRLADRAN AVAEGNYQAE GFEASVHKIK EQVLPRLLER VDPEAAATLT KDELSEEFRP IIMEVLAELK VTLNRREQFA LEKVLIDELL GFGPLEELLN DPDVSDIMVN GPEQTYIEKK GKLQLAPIRF RDESHLFQIA QRIVNQVGRR VDQTTPLADA RLKDGSRVNV IVPPLSLRGT AISIRKFSEK PITIDMLRDF GSMSDKMATC LKIAGASRMN VVISGGTGSG KTTMLNALSK MIDPGERVLT IEDAAELRLQ QPHWLPLETR PPNLEGQGAI TIGDLVKNAL RMRPDRIILG EIRGAECFDL LAAMNTGHDG SMCTLHANSP RECLGRMENM ILMGDIKIPK EAISRQIAES VDLIVQVKRL RDGSRRTTNI TEVIGMEGDV IVTQELFKFE YLDETDEGKI IGEFRSSGLR PYTLEKARQF GFDQAYLEAC L
|
| |