Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1079 |
Symbol | |
ID | 3916375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1122781 |
End bp | 1124400 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443814 |
Product | type II and III secretion system protein |
Protein accession | YP_496358 |
Protein GI | 87199101 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.103165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCCG CGCGACAGAA CCAGATTGCG AAGGTCCGCA CGATGAACCG CCGAATCTCC ACTTCGCTCC TCACGGCCGC CGCGCTCGCG CTGCCGCTGG CCCTGGTCCC GGCAGGCGCC CCGCTCCATG CCCAGGGCAT GACCAGCCCG GCGCGCACGG TGACCATCTC CATCGGTCGC GGCGAACTGG TCACGGTTCC GGGCCGCATG GCCGACGTCT TCGTGGCCGA CGAGAGGGTC GCCGACGTCC AGGTCAAGTC GACCAACCAG CTCTATGTCT TCGGCAAGGC CGGCGGCGAG ACGACGATCT ATGCAAGCAA CGCCAAGGGC GATGTCGTGT GGTCGGCCAA TGTGCGGGTG GGCAACAACC TCGACTCGAT CGACCAGATG CTGCACCTCG CCATGCCCGA AGCGCGCATC AACGTGGCGA CGATGAACAA CACGGTCCTG CTCACCGGCA CCGTGGCAGC GCCCGAGGAT GCCGCCGAGG CCGAACGCCT GGTCAAGGCT TTCGTCGGCG AGAAGACCAA CGTCATCAGC CGCCTCAAGA TGGCAACTCC GCTCCAGGTC AGCCTGCACG TGAAGTTCGC CGAGGTGAGC CGTTCGCTGG TGCGCACGCT GGGCGTCAAC CTCACCACGA TCGACGGCAC CGGAGGCATC AAGTTCGGCA TCGGCCAGGG CAGCACGACC GGCTCCGTGG CGACCAACCG CAACCTGCCG TTTGGCGTCG GCACCAGCTC GACCTCGGGC TACATCCTCG ACCCGACCGG TGCCACGTCC AACTACGTGT CCGCGACTGG AACGTCGGTC GCTGCTAGCA GCAGCGGCAC GACCATTGCG GGCATGGGCA AGCTGCTGGG CCTCGACCTT CTCGCCGCGC TCGATGCCGG TGAGACAATC GGTCTGGTGA CCACGCTGTC GGAACCGAAC CTGACTGCCA TTTCCGGCGA AACTGCCGAA TTCCTGGCGG GTGGCGAATA CCCGATACCG GTTTCGCAGG GTCTCGGCAC GACGTCGATC GAGTACAAGA AGTACGGCGT GAGCCTGGCC TACACGCCGA CCGTGCTGGC CAACGGCCGC ATCTCGATCC GCGTGCGTCC CGAAGCCTCG GAGCTATCCA GCACCGGGGC ACTCAAGCTC GACAGCGTCG AGATTCCCGC GCTGACCGTT CGCCGCGCCG AAACCACGGT GGAACTCGGT TCGGGACAAT CGTTCATGAT CGCCGGCCTG CTTCAGAACG GTGCGCAGAA CGCACTGACC AAGATGCCTG GCGCGGGCGA CATCCCGATC CTTGGCTCGC TCTTCCGCTC GACCAGCTAC AAGAAGGGCG AGACCGAACT GGTGATCGTG GTCACCCCCT ACCTGGTGAA TCCGGTCAAC GCGAACGACA TCAAGCTGCC GACCGACGGC TTCCAGAGCC CGAACGAAAT CCAGCGCCTG CTCGGCCACA TGGAAAGCGA CGGCGTAACC GGCGGCGACC GGCCCAAGCC GACGCAGAAG GAAGGCACGA CTCAGCAGGG CCCCAAGGTG GGCGAACTCG ACGTGCCCAC CACGCCGGCC GACCGCAAGA AGGTCGCCGC CGCGCCTGCC GCCGCCGAAC CCGGCTTCAG CATCCAGTGA
|
Protein sequence | MKPARQNQIA KVRTMNRRIS TSLLTAAALA LPLALVPAGA PLHAQGMTSP ARTVTISIGR GELVTVPGRM ADVFVADERV ADVQVKSTNQ LYVFGKAGGE TTIYASNAKG DVVWSANVRV GNNLDSIDQM LHLAMPEARI NVATMNNTVL LTGTVAAPED AAEAERLVKA FVGEKTNVIS RLKMATPLQV SLHVKFAEVS RSLVRTLGVN LTTIDGTGGI KFGIGQGSTT GSVATNRNLP FGVGTSSTSG YILDPTGATS NYVSATGTSV AASSSGTTIA GMGKLLGLDL LAALDAGETI GLVTTLSEPN LTAISGETAE FLAGGEYPIP VSQGLGTTSI EYKKYGVSLA YTPTVLANGR ISIRVRPEAS ELSSTGALKL DSVEIPALTV RRAETTVELG SGQSFMIAGL LQNGAQNALT KMPGAGDIPI LGSLFRSTSY KKGETELVIV VTPYLVNPVN ANDIKLPTDG FQSPNEIQRL LGHMESDGVT GGDRPKPTQK EGTTQQGPKV GELDVPTTPA DRKKVAAAPA AAEPGFSIQ
|
| |