Gene Saro_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1856 
Symbol 
ID3917077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1955826 
End bp1957391 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content64% 
IMG OID640444600 
Producttype II secretion system protein E 
Protein accessionYP_497130 
Protein GI87199873 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.808966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTT TCGGGCGAAA GAACGGCATT GCAGGCAATT CCGGCGGAGG TCGCCCCTCG 
TTCGGCGTGG CCCGCCCGAT GCGTGGAGGC GAAGCCGGGG CCACGGGCGC CGCGCCGTCG
TCGGCTTCCA CGCCGATGCC CGTCCCGATG CCCATGCCGA TGGGAGGGGA CCAGTTCCCG
CCGATCCCGT CGATGGAGGA CATGGACGCC CCGTCGAGCG CGGCTGCCCT GAAGGCCGAT
GCGCTGTCGC GCCTGGCGGA CCGCGCCAAC GCGGTTGCCG AGGGCAATTA CCAGGCCGAA
GGGTTCGAAG CCTCGGTCCA CAAGATCAAG GAACAGGTGC TGCCGCGCCT GCTCGAGCGC
GTCGATCCGG AAGCGGCCGC GACCCTGACC AAGGACGAGC TGTCCGAGGA ATTCCGGCCG
ATCATCATGG AAGTGCTGGC CGAGCTGAAG GTCACGCTGA ACAGGCGCGA GCAGTTCGCG
CTGGAAAAGG TGCTGATCGA CGAGCTGCTC GGCTTCGGTC CGCTGGAAGA GCTGCTCAAC
GACCCGGACG TTTCCGACAT CATGGTCAAC GGACCTGAGC AGACCTACAT CGAAAAGAAG
GGCAAGCTGC AACTTGCCCC GATCCGTTTT CGCGACGAAA GCCACCTGTT CCAGATCGCC
CAGCGCATCG TCAACCAGGT CGGCCGCCGC GTCGACCAGA CCACGCCGCT GGCCGACGCC
CGCCTCAAGG ACGGCAGCCG CGTCAACGTG ATCGTGCCGC CGCTGTCGCT GCGCGGCACC
GCGATCTCGA TCCGCAAGTT CTCCGAAAAG CCGATCACGA TCGACATGCT GCGCGACTTC
GGCTCGATGT CGGACAAGAT GGCGACCTGC CTCAAGATCG CCGGCGCAAG CCGCATGAAC
GTGGTCATCT CGGGCGGTAC CGGTTCGGGC AAGACGACGA TGCTCAACGC CCTGTCGAAG
ATGATCGACC CGGGCGAGCG CGTGTTGACC ATCGAGGACG CGGCCGAACT CCGCTTGCAG
CAGCCGCACT GGCTTCCGCT GGAAACGCGT CCGCCGAACC TTGAAGGCCA GGGTGCGATC
ACCATCGGCG ACCTTGTGAA GAACGCGCTG CGCATGCGTC CCGACCGCAT CATCCTGGGC
GAAATCCGTG GCGCAGAATG CTTCGACCTT CTGGCGGCGA TGAACACCGG CCACGACGGG
TCGATGTGCA CGCTTCACGC CAACAGCCCG CGCGAATGCC TTGGCCGTAT GGAAAACATG
ATCCTGATGG GCGATATCAA GATCCCCAAG GAGGCCATCA GCCGCCAGAT CGCGGAATCG
GTCGACCTGA TCGTTCAGGT CAAGCGCCTG CGCGATGGTT CGCGTCGCAC GACCAACATC
ACCGAAGTGA TCGGGATGGA AGGCGACGTG ATCGTGACGC AGGAACTGTT CAAGTTCGAA
TATCTGGACG AGACCGACGA AGGCAAGATC ATCGGCGAGT TCCGGTCCTC CGGCCTGCGC
CCATACACTC TTGAAAAGGC GCGCCAGTTC GGGTTCGACC AGGCCTATCT CGAGGCCTGC
CTCTAG
 
Protein sequence
MSAFGRKNGI AGNSGGGRPS FGVARPMRGG EAGATGAAPS SASTPMPVPM PMPMGGDQFP 
PIPSMEDMDA PSSAAALKAD ALSRLADRAN AVAEGNYQAE GFEASVHKIK EQVLPRLLER
VDPEAAATLT KDELSEEFRP IIMEVLAELK VTLNRREQFA LEKVLIDELL GFGPLEELLN
DPDVSDIMVN GPEQTYIEKK GKLQLAPIRF RDESHLFQIA QRIVNQVGRR VDQTTPLADA
RLKDGSRVNV IVPPLSLRGT AISIRKFSEK PITIDMLRDF GSMSDKMATC LKIAGASRMN
VVISGGTGSG KTTMLNALSK MIDPGERVLT IEDAAELRLQ QPHWLPLETR PPNLEGQGAI
TIGDLVKNAL RMRPDRIILG EIRGAECFDL LAAMNTGHDG SMCTLHANSP RECLGRMENM
ILMGDIKIPK EAISRQIAES VDLIVQVKRL RDGSRRTTNI TEVIGMEGDV IVTQELFKFE
YLDETDEGKI IGEFRSSGLR PYTLEKARQF GFDQAYLEAC L