Gene Saro_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1521 
Symbol 
ID3917196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1565208 
End bp1566404 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content59% 
IMG OID640444262 
ProductFlp pilus assembly protein ATPase CpaE-like 
Protein accessionYP_496796 
Protein GI87199539 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCC CTGGCGCTTC GGCCACGACC TCGCCCTTTG GCGGAAATGG TCCAGTTGTC 
GCAGTGTGTG CTTCGGCGAA GCAACTCGCC TTGCTCGAAC GGCAAGTAGA TGCTCTTGCT
CGAATCCTGG TTGTTCCTTT CCCGCTCGGC TCCGCGGACG GGGTTGATCA GACTTCTGTC
GCAAAGGCGT CGGTTTTTGT CCTCGAGGTC GATCCGGCCG ATAGGGCTTC GGTCGATCGC
CTGATCCGGA TGCGCAAGGC GATGCCCGCA ACTGCGCTCA TTGCCGCCGT CGAAAACCCT
GATCTTACCC TGACGCGCAC TCTCGTGCGC GAAGGGGTCA CTGACGTGAT TGCCTTGCCG
TTTCGAGCCG ATGAACTTGT CAGCGCAACG CTCGATGCCA TGGCGCGGCA TGCGCAGGCG
ATCGTTCCGG TGACGCTCGC GCCCGTGATT GGCGTGGTAC GCAGTTGCGG AGGATGCGGG
GCTACGACGG TCGCCACGCA CCTTGCCCAT GCCCTCAATC AATTTAGCTG GACGAATGGT
CCGGCTATCG TTGCGGACCT CGATCTCCAG TTCGGCGAAG TCGGCGCATA TCTTGACAGC
AGTCGCAGTG GGTCGATCAC CGATCTCATG CTCGCGCATG ACAGGATTGA CCGCGAGTTC
CTCTATTCCA TGGCGCCGCC TACCTCTGGC GGTGTCGGGG TGCTGTCTGC GCCAGCTACC
ATCAACTCAA TCGAGTCCGT GAACGTCGAT GACATGCTCT TCGTCCTGGA CCAGCTGCGC
CGGAATTACG GAGTAGTTGT CCTCGACTTT CCATCCGCAT GGAGCAACTG GGCTGCATCG
TTGGCCGTTC TTTCCGACAT TCTATTGCTG GTGACGCCCG TTGCACTTTC CGGACTGCGG
CAGACGAAAC GAACACTCGA CCTGTTTCGG ACATTGGAAA TTCCGGACGA AAAGGTGGCA
ATCGTGGCCA ATCGCGTCGA GCGGAAACTT TTCCGCCTTG TTGGTACGAG CGAGGCTGAG
GCGGCGATCG GCCGCAGTTT CGCCGCCTCG CTCTCGGATG AAGGCGATCA GATGGTTCGT
GCGCAGGAGC AGGGCGTACT GATCCACAGC ATCCAGAAGA AGACCTCTTT CAGCACAGCG
CTGATGAAGC TTGCGCAGTC GATACACGTT CAGTTGCATT CCGGACAGTT GCTATGA
 
Protein sequence
MSGPGASATT SPFGGNGPVV AVCASAKQLA LLERQVDALA RILVVPFPLG SADGVDQTSV 
AKASVFVLEV DPADRASVDR LIRMRKAMPA TALIAAVENP DLTLTRTLVR EGVTDVIALP
FRADELVSAT LDAMARHAQA IVPVTLAPVI GVVRSCGGCG ATTVATHLAH ALNQFSWTNG
PAIVADLDLQ FGEVGAYLDS SRSGSITDLM LAHDRIDREF LYSMAPPTSG GVGVLSAPAT
INSIESVNVD DMLFVLDQLR RNYGVVVLDF PSAWSNWAAS LAVLSDILLL VTPVALSGLR
QTKRTLDLFR TLEIPDEKVA IVANRVERKL FRLVGTSEAE AAIGRSFAAS LSDEGDQMVR
AQEQGVLIHS IQKKTSFSTA LMKLAQSIHV QLHSGQLL