Gene Saro_3314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3314 
Symbol 
ID3915961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3536777 
End bp3538483 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content69% 
IMG OID640446099 
Productheat shock protein Hsp70 
Protein accessionYP_498583 
Protein GI87201326 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATCG GGATCGACCT GGGAACCACC AACAGCGCGG TGGCCTTGTG GCATGAGGGC 
GAGGCACGGC TCGTGCCCAA TTCACTGGGA GCGCTGCTGA CCCCGTCGGC GGTTTCGGTG
CTGCAGGACG GGACGACGCT TGTCGGCGCG GCGGCATTCG AACGCATGGC CGCGAAGGAT
GGCGCGGCGG CTACCAGCTT CAAGCGCCTG CTGGGCACCG ACCGCAAGGT GCGCCTTGGC
CGCAAGGAGT TCTCGGCGGA AGATCTCTCG GCGCTGGTCC TGCAGTCACT GTGCGCCGAC
GTCGAGGCGC ATACCGGCGA GCGGCCGACC GAGGCGGTGA TCACGGTCCC CGCCTATTTC
AACGACCGCC AGCGCAAGGC GACGCGCCGC GCAGGCGAAC TGGCGGGCCT CTCCGTGCGG
CGCCTGATCA ACGAACCAAC CGCAGCCGCG CTGGCATTCG GCCTGAAGGA CAAGGCGGAG
CGTGAACCCT TTCTGGTCTT CGATCTCGGT GGCGGGACGT TCGATGTCTC CATCGTCGAG
ATGTTCGAAG GCATCGTCGA GGTCCGCGCC TCGGCGGGCG ACAACCGGCT TGGCGGCGAT
GATTTCAACG GTGCGCTGGC GATGGCGGTC AAGGGGCGGC TCGATCCCGA CGAAAGACTT
GCGACCCTGG GCGAGGCCCG CGCGCAGGCG CTGCTGCTGC AGGCGGCCGA ACGGACCAGG
CGTGCCCTGA CCGATGCACC GGAGGCCGAG TTCGCCGTGA CCGCCGGCGA CGAACGGCTT
TCGACCACCG TCACCGCCAG CGAGTTCGAG GCGCAGGCCG AGGGCCTGCT GCGCCGGTTG
CGCGACCCGG TCGTGCGGGC CCTGCGCGAC AGCCAGATCG ATGCGGCGTC CCTCAGCGAG
ATCGTGCTTG TCGGCGGGGC GACGCGCATG CCGCTGGTGC GCAAAGCGAT AACCCGGTTA
TTCGGCCGCT TCCCCAATGC GTCCGTCCAC CCGGACCACG CGGTCGCCCT TGGCGCCGCG
ATCCAGGGCG GGCTGATCGC GCGCGATGGC GGGCTGGAGG AAATCCGGAT CACCGATGTC
TGCCCCTTCA CGCTCGGCAT CGAGACGGCG GAACACTCTG TGCGGGGCAC GATCCAGCAG
GGCCTGTTCT CGCCGATCAT CGAGCGCAAT ACCCCGGTGC CGGTCAGCCG CTCGGGCGTC
TACAGCACCA TGGGCGACGG GCAGAAGCAG ATCGCGGTGC ACATCTACCA GGGCGAGGCG
CGCGAGGTTT CGGGCAATGT CAAACTCGGG ACGCTGTCCG TCCCGGTGCC GTCCCGGCCC
GCTGGCGAGG TGTCGATAGA CGTGCGCTTT TCCTATGATA GCTCCGGCCT GCTCGAAGTG
GACGTTGAGG TGCCGCTGAC CGGGACGAGG CACAACCTCG TCATCATCGA CGAGGAAGAC
CGCAAGGCCG CGAAGGATCT CGATGCCCGT CGCAAGGCGC TGGCCGCGCT CAAGCACCAC
CCGCGAGAGG AGGCCGCGAA CCAGCTCCTG CTCGCGCGGG CCGAACGCTG CTACGAGGAA
TTCCTCGGCG ACGTGCGCGC GGTGATCGGC GGGCGCACGC TTTCGTTCAC CACCGCGCTC
GACAGCCAGG ACCCGCGCCG CATCGCCGAC GCCGCGGCCG AGCTTGCCGA ACTGCTTGAC
ATGCTGGAAG CGAACCCGGT CCTGTGA
 
Protein sequence
MLIGIDLGTT NSAVALWHEG EARLVPNSLG ALLTPSAVSV LQDGTTLVGA AAFERMAAKD 
GAAATSFKRL LGTDRKVRLG RKEFSAEDLS ALVLQSLCAD VEAHTGERPT EAVITVPAYF
NDRQRKATRR AGELAGLSVR RLINEPTAAA LAFGLKDKAE REPFLVFDLG GGTFDVSIVE
MFEGIVEVRA SAGDNRLGGD DFNGALAMAV KGRLDPDERL ATLGEARAQA LLLQAAERTR
RALTDAPEAE FAVTAGDERL STTVTASEFE AQAEGLLRRL RDPVVRALRD SQIDAASLSE
IVLVGGATRM PLVRKAITRL FGRFPNASVH PDHAVALGAA IQGGLIARDG GLEEIRITDV
CPFTLGIETA EHSVRGTIQQ GLFSPIIERN TPVPVSRSGV YSTMGDGQKQ IAVHIYQGEA
REVSGNVKLG TLSVPVPSRP AGEVSIDVRF SYDSSGLLEV DVEVPLTGTR HNLVIIDEED
RKAAKDLDAR RKALAALKHH PREEAANQLL LARAERCYEE FLGDVRAVIG GRTLSFTTAL
DSQDPRRIAD AAAELAELLD MLEANPVL