Gene Saro_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2053 
Symbol 
ID3917700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2194213 
End bp2195349 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content66% 
IMG OID640444805 
Productchaperone protein DnaJ 
Protein accessionYP_497326 
Protein GI87200069 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.519265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCCG AGATCGATTT CTACGAACTG CTCGAAGTGG AGCGCACCGC GGACGACAAG 
GTGCTGAAGT CCGCTTATCG CAAGCTGGCG ATGAAGTACC ACCCGGACAA GAATCCCGGG
TGCGCGGATT CCGAAGCCAG GTTCAAGCAG ATCAACGAGG CCTATGCCTG CCTCTCCGAC
CCGCAGAAGC GGGCGGCCTA CGATCGCTAT GGCCACGCCG CGTTCCAGCA GGGCGCCGGT
GGAGGAGCCG GGGGCGGTGG CTTCGGCGGG GGCGACTTCG GCGACATCGG CGACATTTTC
GAGACGATCT TCGGTGGCGC CTTCGGCGGC GGCGGCCGGC AGCAGCCGCG CCGTGGGGCC
GACCTGCGCT ACGACATGGA AATCAGCCTG GAAGACGCCT TCCACGGCAA GGATGCCGAA
ATCGAGGTCG AGGTTTCGCA GAAGTGCGAT CCCTGTGGTG GATCGGGCGC GACGCCCGGC
ACGTCCGCGC GGCGCTGCAA CCTTTGTGGC GGCCACGGCA AGGTCCGTGC CCAGCAGGGC
TTCTTCATGG TCGAGCGGCC GTGTCCCAAT TGCCACGGGC GCGGCGAAGT GATCGAAAAG
CCTTGCAAGC ATTGCCGTGG CGAGGGCCGG GTCGATGCCG AGCAGCGATT GCAGGTCACG
ATCCCGGCCG GGGTGGACAA TGGCACCCGC ATCCGCCTTG CCGGAAAGGG CGAGGCGGGG
CCGTTCGGTG CGCCTCCGGG CGACCTCTAC ATCTTCCTCC ACATCCAGCG GCACCGGGTG
TTCGAGCGCG AAGGCACGAC GCTGCTGACC CGTTGCCCGA TCAGCTTCAC CACTGCGGCG
CTTGGCGGCG AGATCGAGAT TCCGGGCCTC GACGGCAAGG TCCACGCCAT CGAGATTCCC
GCCGGCATCC AGTCCGGCAA GCAGTTGCGC AAGCGTGGCG CGGGCATGCC CGTGCTTCAG
GGACGCGGAA TTGGCGATCT CGTGGTGGAG ATCCACGTCG AGACGCCGAC CAAGCTGTCC
GCGCGCCAGA AGGAACTGCT GCGCGAGTTC CAGGCGACCG AGACCGGCGA GGAATGCCCC
CAGTCGAAGG GCTTCTTCGA ACGGATCAAG GACGCCTGGA CCGACCTTAC CGAGTAA
 
Protein sequence
MSAEIDFYEL LEVERTADDK VLKSAYRKLA MKYHPDKNPG CADSEARFKQ INEAYACLSD 
PQKRAAYDRY GHAAFQQGAG GGAGGGGFGG GDFGDIGDIF ETIFGGAFGG GGRQQPRRGA
DLRYDMEISL EDAFHGKDAE IEVEVSQKCD PCGGSGATPG TSARRCNLCG GHGKVRAQQG
FFMVERPCPN CHGRGEVIEK PCKHCRGEGR VDAEQRLQVT IPAGVDNGTR IRLAGKGEAG
PFGAPPGDLY IFLHIQRHRV FEREGTTLLT RCPISFTTAA LGGEIEIPGL DGKVHAIEIP
AGIQSGKQLR KRGAGMPVLQ GRGIGDLVVE IHVETPTKLS ARQKELLREF QATETGEECP
QSKGFFERIK DAWTDLTE