Gene Saro_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2175 
Symbol 
ID3918840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2317040 
End bp2320144 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content64% 
IMG OID640444930 
Producthypothetical protein 
Protein accessionYP_497448 
Protein GI87200191 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0743489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATGG GTTACAATAT TCCTTTGGTC ACCTCTCTCT GCGACATCAG CCCGGACGTG 
TTCTCGATGA GCCGATCCGA GCAAGTGGAG CACCTCTCAT CGATTGGCGA CGCTGATATT
TCGACTGCGC GTGCCTTCTA CGCTCGCAAC CACGTTACAA GCGGAATGTC CGAGTTCCTG
CGTGGCGCTA TGCGACGCCT GTCCGGTCAA AGCCAACAGG CGGTCTTCGA GCTTCGTCAG
GCGATGGGCG GCGGTAAGAC CCACAACATG ATCGCCCTCG GCCTCTTGGC TCGGTTTCCG
GAGTTAAAGG ATCAACTTCC AGACACGATC ACGGCCGGGA TGGGTGACGA GCCTGCTGTA
ATCGCCACCG TCAACGGGCG GGACGTTCAG AACTTCATTT GGGGGGACAT CGCCGAGCAG
CTTGGTCGGG CGGAGGCGTT CCGGGACCAC TGGGTCAACG GGCCGAAGGA GATGAACGAG
GGGGATTGGA TCCGGCTCAT AGGGGACCGG CCCACCCTGA TCATGCTCGA CGAGCTCCCC
CCGTACCTCG CCATGGCGCA CACAAAAACG GTCGGTCAGG GGACCCTGCT GGACCTTCTA
AAATACTCAA TCGCAAACCT TTTCTCCGCC GCGATGAAGC TCAAACGCTG CGTGGTCGTG
GTCGCCTCCC TCGACGCCGC CTATGACGAG GCGCGCCGGA TCTTAGGCGG GCAGCTCGCG
GACCTCCAGA AGGAGACGAG CCGCGGCGCG AAGTCGATCA CCCCGGTCGA CCTGAACACG
GGCGAGATCT ACGACATCCT GCGCAAACGC CTCTTCACCC GACTCCCGGA CCCGGACGGG
GCAGAGGTGG ACAGAGTGGC GCAGGCCTAT CTTGCCGCCT ACCAGGAGGC GATTCGGGGC
CGGGCGCTAG CGAAATCTGC CGAGCAGATG GCTGACGAGA TCGTCGCCTC CTACCCGTTC
CACCCGAGCT ACAAGGACAT TCTCTCCCTT TTCAAGGAGA ACGAGAAGTT CCGTCAGACC
CGGGGCTTAA TCCAGTTCAC CGCGAACCTG ATGCGCGGGG TCTGGGGGAG AAAGGACCAA
GAGGAAGTGT TCCTGATCGG GGCGCAGTTC CTCGACTTCT CCGACCAGGA GACCCGAGAT
CAGGTAAAGG AGATCGAGCG GTCGCTGGAG TCCGCTCTAG CCAGCGATGT TTATGACACG
GACGGATCCG CGCACGCTCA GGGCATCGAC GGGGACCGGA ACGACCGCGC CGCGTCGCAG
GTGGCCACCC TCCTGTTCAT CTCGTCGCTG TCGGACAACA CAGACGGGAT CCGGGGCCTG
CCCCGCGACA CAGTCGTGGA GTACCTGGTC TCCCCCGGCC AAGAGCCGAC GCGGTTCATC
GAGGCGTTCG ACCAGCTTCG GGACCGTTGC TGGTACCTGC ATAATCGGGA CGGGGATCGG
TGGTACTTCT CCGACATCGC CAACGTCCGC AAGCAGATCG AGGACAAGGT CGGAAAGATT
CCGCAGGACC GGGTCGACGA GGAGATGCGT CGCCGCCTCA CGGACATCTT CCGTCCCGTC
AACAAGCTCG CCTACTCTGA GCTGGTGGTC CTTCCGCGGG TCGATGAAGT GAACCTGACC
CCGTCTAAGC GGACCTGTCT GGTCCTCTCC CCCGACGCGA AGTCCCCGCC CGCTGCGGCA
GCGCGGTTCT TCGACGACGT GGTTTACAAG AACGCGTTCT GCGTGGTCGC CGGGGACGGG
TCGAAGATGG CCAGCGCGGA GGACAGCGTC CGTCGCCTCC TTGCGATCGC TGCCGTGAAG
ACGATCGTGG CGGACACCCC CCGGCACCAG CGGGAAATTG AGACGGAGCA GGAAACGACC
GAGATTGGGT TCAACTCCAC CGTCAAGAGC CTCTTCAACG CGGTTTGGTA CCCGCAGACG
AAGGAGCTGA AGAGCGCCCG GATCGACCTC GGCCACTTCC AGGAGCGGGG AGTGATCCAA
GGGGAGAAGG CGGTTGAGGC GGCTCTCGCG GGAGGAGGGG CAAAGAAGCT CGTCGAGCTG
GACCCGGAGA AAACGGACGG GCTGATCCAG CGCTGCGAAG ACCAGCTGTT CCCGGAGAAC
CAGTCCCGCA CCCGCTGGTC GGACGTGCTG GAGCGGGCCG CGTCGAACCC TCGCTGGATA
TGGCTCCCCC CGAAAGGAAT GGAGGAGATC AAGGCAGCCG CCCTCGCGGA GGGCCGCTGG
GTCGAGGAGA ACGGCTACGT AGACAAGAGC CCGCCCCCAC CACAGCCACT GATCAGGGTC
ACTCGGATCG GCGGGGACGA GGCGACCGGG GAGAGCGAAC TGGAGCTCGC CGTCTCGAAC
GCCGGGCGGG CACCCGAAGT GCTGGTTGCA CCGACCCGGG AGGGGCTGGA CGCCGGTGAG
ATCATCACGG ACCGAACCTA TCGGACCACG GAGGTGGAAC TCTGGTTCCA GGCCCGTAAC
CCGGAAACAG GGGAGGTGAG CGAGCCGTAC CGCTGGGCCG GGTCTATTAC GATCACTCAT
GAGCGGCGTG ATAACGCGGG CATGTGGCAG GTGACCCTCG CAGCGCGTCC CGAGGCGGAG
CTGCGCTGGA ACACCTTGGG GATCAACCCG AAGGACGGGT CGCTCTATAA TGGCGGAGCG
ATAGAGATCG ACGGGAGGCA AAAGACCACT CTCTACGTCT ACGCGGTCAA GGGGGGCGTG
TCCGCGCAGC GGACCTTCGC CTTCGACGCG GTCGGCGCAC AGCGAACCAT CAACAACGAG
CGCCCCGCCA AGGCGAAACG TGACTTCCAG TTCGCCTCCA AGGGGGAGGT TCTCCGGGTC
GTGCGGGCCG CGAAGGGGAA GGAGAGCGTC CGGTTCCACG GGGTCAGCGT CACTGTTGGT
GAGGGGGAGC GGAGCCTCCG GGTCCGCAGC GGCGGGGACG TTGCCCTGTC GGGTGCGGAC
ATCGAGACGA TGATCGACGG GCTGCGCGGT GCGCTCGGGC AGCCGGACGC GGAGGTGCAA
CTCCGGTTTC GGGAGGCGGA CTTCCCTGAT GGGTATGCCC TGAAGGACTT CGCGACCCAA
GTCGGGATCG ACATCCCGGT TGAGGACGTG GAGCAGGAGG CCTGA
 
Protein sequence
MSMGYNIPLV TSLCDISPDV FSMSRSEQVE HLSSIGDADI STARAFYARN HVTSGMSEFL 
RGAMRRLSGQ SQQAVFELRQ AMGGGKTHNM IALGLLARFP ELKDQLPDTI TAGMGDEPAV
IATVNGRDVQ NFIWGDIAEQ LGRAEAFRDH WVNGPKEMNE GDWIRLIGDR PTLIMLDELP
PYLAMAHTKT VGQGTLLDLL KYSIANLFSA AMKLKRCVVV VASLDAAYDE ARRILGGQLA
DLQKETSRGA KSITPVDLNT GEIYDILRKR LFTRLPDPDG AEVDRVAQAY LAAYQEAIRG
RALAKSAEQM ADEIVASYPF HPSYKDILSL FKENEKFRQT RGLIQFTANL MRGVWGRKDQ
EEVFLIGAQF LDFSDQETRD QVKEIERSLE SALASDVYDT DGSAHAQGID GDRNDRAASQ
VATLLFISSL SDNTDGIRGL PRDTVVEYLV SPGQEPTRFI EAFDQLRDRC WYLHNRDGDR
WYFSDIANVR KQIEDKVGKI PQDRVDEEMR RRLTDIFRPV NKLAYSELVV LPRVDEVNLT
PSKRTCLVLS PDAKSPPAAA ARFFDDVVYK NAFCVVAGDG SKMASAEDSV RRLLAIAAVK
TIVADTPRHQ REIETEQETT EIGFNSTVKS LFNAVWYPQT KELKSARIDL GHFQERGVIQ
GEKAVEAALA GGGAKKLVEL DPEKTDGLIQ RCEDQLFPEN QSRTRWSDVL ERAASNPRWI
WLPPKGMEEI KAAALAEGRW VEENGYVDKS PPPPQPLIRV TRIGGDEATG ESELELAVSN
AGRAPEVLVA PTREGLDAGE IITDRTYRTT EVELWFQARN PETGEVSEPY RWAGSITITH
ERRDNAGMWQ VTLAARPEAE LRWNTLGINP KDGSLYNGGA IEIDGRQKTT LYVYAVKGGV
SAQRTFAFDA VGAQRTINNE RPAKAKRDFQ FASKGEVLRV VRAAKGKESV RFHGVSVTVG
EGERSLRVRS GGDVALSGAD IETMIDGLRG ALGQPDAEVQ LRFREADFPD GYALKDFATQ
VGIDIPVEDV EQEA