Gene Saro_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0302 
Symbol 
ID3916239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp327631 
End bp328644 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content67% 
IMG OID640443031 
ProductTPR repeat-containing protein 
Protein accessionYP_495584 
Protein GI87198327 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.642139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC CCCAGTCCAG TCCGGACCCG AAGCCCAATC CGGAAGCGAG CGCCGAGGCG 
CAGGGGGCTT CGCGCCTTGG CCGCGCGGCG CTGATCGCCG CCGGCGTGAT CGCGCTTGGG
GCAGGCGGCT ACGCCATGAT CGGTCGACAC GATTCGCCGC CGCCTCCGGT CGAGCCGCCT
CCCGCCGCTC CCAGCCAGCA GCCCTCGGTC GACGACGTGA TCGCGAAGCT GGAAAAGAAG
CTGGCGGAAA ATCCGGATGA TGCCGAAGGC TGGCGCATGC TCGGCTGGTC CTATTTCCAG
ACCGAGCGCT ATGCCGAGGC GGCGACCGCA CTGAAGAAGG CAACCAAGCT CGATCCCGAA
CATGCCGAGA CCTGGTCGTT CCTGGGCGAG GCGCTGGTCC TTGCCAGCAA GGAAGAAGGC
CGCATGCCGC GCGATGCCAA GGCGGCATTC GACAAGGCGA TCAAGCTCGA TCCCAAGGAT
GCCCGCGCCC GCTACTTCCA GGCCGTCGCG CTAGACCTTT CGGGCCGGCA CCGCCAGGCG
ATCAACGCCT GGTTCAAGCT TCTCGAGGAT ACGCCCGCCG ACGCACCCTA TGCCGAGGAC
ATCCGCGAGG TGATCCGCAA CGTGGGCGAA CGGCGCAAGA TCGATGTCGA GAAGCGTCTC
GCCGAAGCGC ATTTCGCGGC GCCGGCCAAC GGCGTGATCA CCGATGGCCC GCACAAGGCG
GCGGCGGCCA TCCCCGGTCC CACCAGCGCA GAGATGAAGG CGGCGGCCGG CCTGCCCAAG
GGGCAGCAGG AAGCCATGAT CCGCGGCATG GTCGACGGGC TCGAGGCAAA GCTCGAGAAG
AATCCCGCCA ATGTCGACGG CTGGATCATG CTCATGCGAA GCCGCATGCA GCTTGGCGAG
CCGCGCAAGG CTGCCGAATC GCTGCAGAAG GCCCTTGCCG CGTTCCGCAA CGATGGCGCC
GCCTCCCGCA AATTGCGCGA AGCGGCATCC AGCATCGGCA TTTCCGGCGC CTGA
 
Protein sequence
MTEPQSSPDP KPNPEASAEA QGASRLGRAA LIAAGVIALG AGGYAMIGRH DSPPPPVEPP 
PAAPSQQPSV DDVIAKLEKK LAENPDDAEG WRMLGWSYFQ TERYAEAATA LKKATKLDPE
HAETWSFLGE ALVLASKEEG RMPRDAKAAF DKAIKLDPKD ARARYFQAVA LDLSGRHRQA
INAWFKLLED TPADAPYAED IREVIRNVGE RRKIDVEKRL AEAHFAAPAN GVITDGPHKA
AAAIPGPTSA EMKAAAGLPK GQQEAMIRGM VDGLEAKLEK NPANVDGWIM LMRSRMQLGE
PRKAAESLQK ALAAFRNDGA ASRKLREAAS SIGISGA