Gene Saro_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1079 
Symbol 
ID3916375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1122781 
End bp1124400 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID640443814 
Producttype II and III secretion system protein 
Protein accessionYP_496358 
Protein GI87199101 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.103165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCG CGCGACAGAA CCAGATTGCG AAGGTCCGCA CGATGAACCG CCGAATCTCC 
ACTTCGCTCC TCACGGCCGC CGCGCTCGCG CTGCCGCTGG CCCTGGTCCC GGCAGGCGCC
CCGCTCCATG CCCAGGGCAT GACCAGCCCG GCGCGCACGG TGACCATCTC CATCGGTCGC
GGCGAACTGG TCACGGTTCC GGGCCGCATG GCCGACGTCT TCGTGGCCGA CGAGAGGGTC
GCCGACGTCC AGGTCAAGTC GACCAACCAG CTCTATGTCT TCGGCAAGGC CGGCGGCGAG
ACGACGATCT ATGCAAGCAA CGCCAAGGGC GATGTCGTGT GGTCGGCCAA TGTGCGGGTG
GGCAACAACC TCGACTCGAT CGACCAGATG CTGCACCTCG CCATGCCCGA AGCGCGCATC
AACGTGGCGA CGATGAACAA CACGGTCCTG CTCACCGGCA CCGTGGCAGC GCCCGAGGAT
GCCGCCGAGG CCGAACGCCT GGTCAAGGCT TTCGTCGGCG AGAAGACCAA CGTCATCAGC
CGCCTCAAGA TGGCAACTCC GCTCCAGGTC AGCCTGCACG TGAAGTTCGC CGAGGTGAGC
CGTTCGCTGG TGCGCACGCT GGGCGTCAAC CTCACCACGA TCGACGGCAC CGGAGGCATC
AAGTTCGGCA TCGGCCAGGG CAGCACGACC GGCTCCGTGG CGACCAACCG CAACCTGCCG
TTTGGCGTCG GCACCAGCTC GACCTCGGGC TACATCCTCG ACCCGACCGG TGCCACGTCC
AACTACGTGT CCGCGACTGG AACGTCGGTC GCTGCTAGCA GCAGCGGCAC GACCATTGCG
GGCATGGGCA AGCTGCTGGG CCTCGACCTT CTCGCCGCGC TCGATGCCGG TGAGACAATC
GGTCTGGTGA CCACGCTGTC GGAACCGAAC CTGACTGCCA TTTCCGGCGA AACTGCCGAA
TTCCTGGCGG GTGGCGAATA CCCGATACCG GTTTCGCAGG GTCTCGGCAC GACGTCGATC
GAGTACAAGA AGTACGGCGT GAGCCTGGCC TACACGCCGA CCGTGCTGGC CAACGGCCGC
ATCTCGATCC GCGTGCGTCC CGAAGCCTCG GAGCTATCCA GCACCGGGGC ACTCAAGCTC
GACAGCGTCG AGATTCCCGC GCTGACCGTT CGCCGCGCCG AAACCACGGT GGAACTCGGT
TCGGGACAAT CGTTCATGAT CGCCGGCCTG CTTCAGAACG GTGCGCAGAA CGCACTGACC
AAGATGCCTG GCGCGGGCGA CATCCCGATC CTTGGCTCGC TCTTCCGCTC GACCAGCTAC
AAGAAGGGCG AGACCGAACT GGTGATCGTG GTCACCCCCT ACCTGGTGAA TCCGGTCAAC
GCGAACGACA TCAAGCTGCC GACCGACGGC TTCCAGAGCC CGAACGAAAT CCAGCGCCTG
CTCGGCCACA TGGAAAGCGA CGGCGTAACC GGCGGCGACC GGCCCAAGCC GACGCAGAAG
GAAGGCACGA CTCAGCAGGG CCCCAAGGTG GGCGAACTCG ACGTGCCCAC CACGCCGGCC
GACCGCAAGA AGGTCGCCGC CGCGCCTGCC GCCGCCGAAC CCGGCTTCAG CATCCAGTGA
 
Protein sequence
MKPARQNQIA KVRTMNRRIS TSLLTAAALA LPLALVPAGA PLHAQGMTSP ARTVTISIGR 
GELVTVPGRM ADVFVADERV ADVQVKSTNQ LYVFGKAGGE TTIYASNAKG DVVWSANVRV
GNNLDSIDQM LHLAMPEARI NVATMNNTVL LTGTVAAPED AAEAERLVKA FVGEKTNVIS
RLKMATPLQV SLHVKFAEVS RSLVRTLGVN LTTIDGTGGI KFGIGQGSTT GSVATNRNLP
FGVGTSSTSG YILDPTGATS NYVSATGTSV AASSSGTTIA GMGKLLGLDL LAALDAGETI
GLVTTLSEPN LTAISGETAE FLAGGEYPIP VSQGLGTTSI EYKKYGVSLA YTPTVLANGR
ISIRVRPEAS ELSSTGALKL DSVEIPALTV RRAETTVELG SGQSFMIAGL LQNGAQNALT
KMPGAGDIPI LGSLFRSTSY KKGETELVIV VTPYLVNPVN ANDIKLPTDG FQSPNEIQRL
LGHMESDGVT GGDRPKPTQK EGTTQQGPKV GELDVPTTPA DRKKVAAAPA AAEPGFSIQ