Gene Saro_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2118 
Symbol 
ID3917766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2256144 
End bp2257658 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content71% 
IMG OID640444871 
ProductTPR repeat-containing protein 
Protein accessionYP_497391 
Protein GI87200134 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAACG TCAGGATACT TTCTCTCGCC CTGCTGCTGT CCGCCTGCGG AACCGATACG 
GCCGACCGCG TGGCCCGGGC CCGCTCGGAG ATCGCCGGGA TGGAACTGGC CGCGGCCCGC
GTCGATCTGG CGGCGGCGCT TGCCGAGCGG GGCGACGATG CCGAACTGCT GCGGTTGCTG
GCCTCGGTGC AACTGCGCCT CGGCGACGGG GACGGGGCGG AAGCCACGGC GGCAAGGCTG
GAACGGACCG GGGCAACGGG CGCGGAACTC GCGCGCATGA GAGCTGAAGC CGCACTGCTG
CGCGGCCGTG CACGGGAAGC GCTGGCGCTT CTGGGCAACG ATGCCACGAC TGGCGGCTGG
CGGGTGAGGG CCGCCGCGCT TTCCGCAGTG GGCGACGGGG AGGGAGCATT CAGGGCGCTG
CAATCGGGTC TTGCCGCAGG GTCCGATCCG CTACTGCTGC GCGACATGGC GCGGTTCCTG
ATCGATGCGC AGGACCTTGA TGGCGCACAG CGGCAGGTGG ATGCGCTGGC CCGGATGCAG
GACGATGGTT TCGATGCCCT GATGCTTTCG GCAGACATCG CGGCGCGGCG CGGGCGCTAT
GCCCAGGCCC ATGCCACGCT GGAACGCGCG GCAAAGCGCT ATCCGCGCAT TCCGGACCCG
TGGATCGCCC GGGCCGATGC CTATGATCGA GAGGGCAAGC TCGACGAGGC GGTGGCAATG
ACGGCGCGGG CGGCGGCCCT TGCGCCGGAC GATCCGCGCG TGACCAATCT CAAGGTCGAG
TTCGCCGCGA TGAAGGGCGA CTGGGAGGCG GTCCGCACGG CGCTGGCGCG GCAGGAGGCA
ACGCTCGACC CGCTGTCGGC CAACGGGCTC ACCTATGCCG AGGCGATGCT GCGGCTGGGG
CGGCCGGAGC AGGCGCGGGC GATGTTCCAG CGCGCCCTCA CACGGTCGCC CAACAATCCG
TACTCGCGGC TCATGCTTGC GGAAGCGCAT CTGGCGACGG GCAATGCCGT TGCCGCGCTG
GAAACCGTGC GTCCGCTCGC GCAAAGCCTG ACGGCAGGTC CGCGCGAACT GGAACTTGCC
GAGAAGGCGG CACGGGCGGC CAACAGCGGC GAGGCCGACG CGCTGGCGGC TCGGCTCGCG
GCGGTCCGGA AGTCACAGGT CTCGGCGCTG GCTGCCAAGG GTCAGGCTGC GCTGGTCGGC
GAAGACTGGA ATGGCGCGAT CGAAGCCTAC GGACAACTTG CGCAGATGGG CGAGGATGCG
GACGTGCTGA AGCGGCTGGC GCTGGCGCTG AGCCACGCCG GACGGGTCGA CGAGGCCATC
AGGGCAGCGG ACCGTGCACG GACCCTTCGG CCCGGCGATC CGGACATGAG CTACATGGCC
GGGTATGTGC GCGTCGCGGG CGGGAAGGAC AAGGCCACCG GGCTTGGTCT GCTCCGCCAC
GCGACCGAGT CCGCGCCGGA CAATCTGGTC TTCAAGCGGG CGCTTGCGCG GTATTCGGCG
GCTGGCGGCG CCTGA
 
Protein sequence
MRNVRILSLA LLLSACGTDT ADRVARARSE IAGMELAAAR VDLAAALAER GDDAELLRLL 
ASVQLRLGDG DGAEATAARL ERTGATGAEL ARMRAEAALL RGRAREALAL LGNDATTGGW
RVRAAALSAV GDGEGAFRAL QSGLAAGSDP LLLRDMARFL IDAQDLDGAQ RQVDALARMQ
DDGFDALMLS ADIAARRGRY AQAHATLERA AKRYPRIPDP WIARADAYDR EGKLDEAVAM
TARAAALAPD DPRVTNLKVE FAAMKGDWEA VRTALARQEA TLDPLSANGL TYAEAMLRLG
RPEQARAMFQ RALTRSPNNP YSRLMLAEAH LATGNAVAAL ETVRPLAQSL TAGPRELELA
EKAARAANSG EADALAARLA AVRKSQVSAL AAKGQAALVG EDWNGAIEAY GQLAQMGEDA
DVLKRLALAL SHAGRVDEAI RAADRARTLR PGDPDMSYMA GYVRVAGGKD KATGLGLLRH
ATESAPDNLV FKRALARYSA AGGA