Gene Saro_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2046 
Symbol 
ID3917693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2184172 
End bp2185611 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID640444798 
ProductTPR repeat-containing protein 
Protein accessionYP_497319 
Protein GI87200062 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCTGC GCCAGGCGCT GCGACAGGGT GTCCCGCTGA CCGAGGTGCG CGCCGAGCTT 
GGCGATGCGC TTCTTCTGCA GGGCAACCGC CAGGAAGCGC GCAAGGTGCT TTACGGAGGC
GCCTTTCCAT CTGGCACCGA AGCGCGCGGA TGGCGCCTGA AAGGGCGTCT GGAGCTTGTC
GAGGGCAATC TCGGCGCTGC CGGATACGCC TTCGACCAGG CATTGCGATT GGCGCCTGAC
GAGTCGTCGT TGTGGATAGA CATTGCCCGT TTGCGCTTCA TGGGCGGCGA GGAGGCGCAA
GCGATCGAAG CCGCCGATCG CGCGGTGCGA CTGGCGCCGC GCGATCCCCG CAGCCTCGAA
TTGCGCGGGT TGCTGGTGCG CGAACAGTTT GGCCTGCGTG CAGCGCTGCC ATGGTTCGAG
GCGGGGCTGG CGGCGGCCCC TGACGACACC GGCCTGCTGG GCGAATATGC CGCCACGCTT
GGCGACCTTG GCCAGTATCG CGCGATGCTG GTCGTGTGCC GCAAGCTGGC AAAGGTAGAT
CCGGGCAACC TGCGCGCGCT CTACCTTCAG GCCGTACTGG CAGCGCGGGC AGGGCGCATC
GATCTTGCCC GCAAGATCAT GCAGCAGACC GGCACGGCGT TTCGCGATGT CCCGGCTGCG
ATGCTGCTCA ACGGATTGCT CGAGTATCAG GCCGGGAACG CCAATCTCGC GGTCGGGTAC
TTCGACCGCC TGGTGCGGGC GCAGCCCGAC AACCTCCAGG CGCGGACACT GCTGGCAAGG
GCGCTGGAAC GCGAAGGATT GAATCAGCAG GCCCTCGATG TCGCAGGCCA GTGGGCGCAA
TCCGCATCCG CGTCGCGCTA TCTTCTTATG GTCACGGCGG ATGCGCTGTC CGGGCTCAGG
CGAAAGCGCG AAGGCGAACA ATTGCGAGGG CGCGCTGCGC GAGCGGAGCC AGTGCCTGCG
ACGGTCATCC CGACAGGACA GCCCCTTGGC GCCCTGGCCA TTGGCTATGG ACAGTCGCCG
AACCTTGCCG CCACCGCCGT GCCCTACATT CGCGGGTTGA TCGAGGCGGG AAGTGCGGGC
GAAGCGGTTG CCGTTGCAGA CCGGCTGCGT CAGGCCAGCC CCGGCGCGGC GGGAGCCTGG
CTTCTTGCCG GCGATGCGCG CCTCATGTCC GGGGATTTCG CCGAGGCGCA GGAAATGTAT
GGGCGCGCTG CCGTCATTCG TTTCAACCTT CCGACGCTTC AGCGCATCGA CCTGGTTCTG
AGGCGTCAGG GCAAGGCTGC CGAGGCCAAT GCACTCGTCG CGCGCTATCT CTGGCAGAAT
CCCGGCAGTC CGCAGGCGAT GAAGCTGCTT TCGGCGGGGC GCGCCGAACT GGGCGATGCG
GCGGGCGCCG CCATGATCGA GGCGGTGCTG CGCGCCAGGG GCCTGCGCAA TCCGTCATGA
 
Protein sequence
MPLRQALRQG VPLTEVRAEL GDALLLQGNR QEARKVLYGG AFPSGTEARG WRLKGRLELV 
EGNLGAAGYA FDQALRLAPD ESSLWIDIAR LRFMGGEEAQ AIEAADRAVR LAPRDPRSLE
LRGLLVREQF GLRAALPWFE AGLAAAPDDT GLLGEYAATL GDLGQYRAML VVCRKLAKVD
PGNLRALYLQ AVLAARAGRI DLARKIMQQT GTAFRDVPAA MLLNGLLEYQ AGNANLAVGY
FDRLVRAQPD NLQARTLLAR ALEREGLNQQ ALDVAGQWAQ SASASRYLLM VTADALSGLR
RKREGEQLRG RAARAEPVPA TVIPTGQPLG ALAIGYGQSP NLAATAVPYI RGLIEAGSAG
EAVAVADRLR QASPGAAGAW LLAGDARLMS GDFAEAQEMY GRAAVIRFNL PTLQRIDLVL
RRQGKAAEAN ALVARYLWQN PGSPQAMKLL SAGRAELGDA AGAAMIEAVL RARGLRNPS