Gene Saro_2736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2736 
Symbol 
ID3916895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2960755 
End bp2963025 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content65% 
IMG OID640445514 
Producthypothetical protein 
Protein accessionYP_498006 
Protein GI87200749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAGA TCGATCCCGT CATTCTGCAG CTTCAGGCGG ACCTGAAGCA GTATCGCTCT 
GACCTCACCG GCGCCCAGAG GCTCACCGAA ACGAAGCTCG CCGCCATCGA GGCTCGCGGC
GTCGCGATGG GGCAGAACAT CCGCAAGGGC TTTGACCTCG CGAAGGGCGC GGCGATTGGC
TTCCTCGCAA CGGTCAGTGT CGACGCGCTG ACACAGGCGG CCAAGCGCGG CCTCGATTAT
GCATCCAGCC TCGGGGAAGT TGCGCAGCAA CTCGGCGTAA CCACGGACGC GCTTCAGGAA
TATCGCTACG CCGCGTCGCA GGCTGGTCTT TCCCAGGAGG AAATGGACCA GGCGCTGTCG
CAGCTCACCC GCCGCATTGG CGAGGCAGCG AGCGGGACAA AGGCGCAGGC CGAGGCCTTC
ACGAAGCTCG GCATCTCGGT CAAGGACGCG AACGGGAACG TCATGGACGC CGGTCGGGCG
ATCCCCATGA TCGCCGATGC GCTGCAAAAG ATCGAGAGCC CGGCCGAGCG CGCCGCGATC
CTCATGGACT TGTTCGGACG CGCCGGCCAG AAGCTCGAAC CGCTGCTTTC GGGTGGTTCG
GCGGCTGTGA ACGAGTTGCG CGACGCGGCG CACAAGCTCG GCATCGTCCT GTCGGAAGAC
CAGATTCAAC GGGCGGACGA GACGGCGGAC AAGCTTTCCG CTCTCAAGCA AGTCCTTGAG
GCGCGTATCG CGGGGGCAGT CTCGGACAAC GCCAGTGCGA TCCTCTCGCT TGCCAACGCG
CTGGCCAGCG TCGTTGACTG GGCGGGCAAG GCCGCAGACG CCTACCGCCG GTTCAAGCTC
GAACAGGGGC TGCGGGAATC GCAGGCGATG CAGACGGGCT GGTTCCGCTC CGATGCTGAC
CGCGCCAAGG GCCAGCGGGA CGAGCAGCTG TATCGCTACG AAATTGCCAA GATGGACGGC
AAGGTCGACA CGACCGGCGG CTTCCGGGAC TACCGCATCA CCGGGATCGG AGGCGCCAGC
GCAACCCCAG CACCCGGGGC TGTCGCATCT GCGGCGACAA CTAAGAAGAC GAAGGCAGCT
ACTGCCGGAC CTTCAGGCCC ATCTGCCGCC GAGATCATGG CCCGCATCGA CAGCCAGTTG
GCGTCTATGG CGCAGCAGGC CCTGTCCGCG ATGGAGAGCG TCGCCAAGTC CGCCGATGAG
CGTGCGGAAC TTGAACTGCG CAGCGTCGAG CTGGCGCGCG TTCGGGCCTT GCGAGAGGTT
GACACTGACA CGGACCTTGA CCGGCTCGGC AAAGAGGGAG CGGCGAACCA GCGTGCGCGC
CTCAAGACGC AGATCGAGGC GTTGGCCGAT GCCGAACGCG ACCGCATCGA GCAGCGCCGG
AAGGCGGAAC TCGAACAAGA CGCCCGCGAC CTCGCCCAGG AACGCTACAG CACCGATCGC
GACGGATTGC AGATCCAGTA CGATCTCGCG GACAGTCAGA CCGAGCGGAA ACGCCTTGCG
CTCGAAATGC TCGACCTCGA GCTGCGCTAT CAGAAGGCGC TGCTCGAAGG CGTGATCGCT
TCGGAAACTG CGACCGAGGC AGAGAAAAAG CGCGCTCAGG CGGCACTCGA CGGTCTTAAT
GCAACAGCAT CCGGCAAGCG CGAGGCGGCC TCGCGGTCCA ACGAGACGCC GCTGGAAGCA
TATCGTCGGA AGCTCGATCG CAGTCCGGAC GCGATCAACG AGCAAGTCGA ATCCTACGTC
GTCGAAGAAC TCGACAACGT CCGCGACGGC ATCCGCGGTG CGCTGGAAAA GGCGATCGGC
ACCGACGATC CGCTGATTTC TGGCCTGCTG AACCTCTTGA TCGAGCAGGT CATTCTGCGT
CCACTCGCCG AAGCTCTGGC GAGCGCGTCC GGCGGGGGCG GCGGATTTCT CGGTGCCGTC
GCGTCCGGCA TCGGCTCATT GTTCGGCCGA GCATCGGGCG GATACGTCGC GCCTGGCCAG
ATGGTGCGGG TCAACGAAGG CGCGTCGCCG GGTCGCGTGG AAGGCTTCAT CCCGCAGGGC
GGCGGACACA TCGTGCCGCT GGGCCGCATG AATGCGCTGC GCCAGGCAGG CGGTCAGAAG
GTTTTTCAGA TCAGCATCGA CGCTCGCAAC AGCGTTACCC CCGACGGATT TGCGCGCGAA
CTGTCGAGCC AAATCCTGCG CCAGGCCGCC GCGATGGACG GCCAGACTGC GCAAGCAGTC
CTCAAGGCCG CGCCGGGCCG GATGAGCCAG TACCAGCGGG ACAAAATCTG A
 
Protein sequence
MPEIDPVILQ LQADLKQYRS DLTGAQRLTE TKLAAIEARG VAMGQNIRKG FDLAKGAAIG 
FLATVSVDAL TQAAKRGLDY ASSLGEVAQQ LGVTTDALQE YRYAASQAGL SQEEMDQALS
QLTRRIGEAA SGTKAQAEAF TKLGISVKDA NGNVMDAGRA IPMIADALQK IESPAERAAI
LMDLFGRAGQ KLEPLLSGGS AAVNELRDAA HKLGIVLSED QIQRADETAD KLSALKQVLE
ARIAGAVSDN ASAILSLANA LASVVDWAGK AADAYRRFKL EQGLRESQAM QTGWFRSDAD
RAKGQRDEQL YRYEIAKMDG KVDTTGGFRD YRITGIGGAS ATPAPGAVAS AATTKKTKAA
TAGPSGPSAA EIMARIDSQL ASMAQQALSA MESVAKSADE RAELELRSVE LARVRALREV
DTDTDLDRLG KEGAANQRAR LKTQIEALAD AERDRIEQRR KAELEQDARD LAQERYSTDR
DGLQIQYDLA DSQTERKRLA LEMLDLELRY QKALLEGVIA SETATEAEKK RAQAALDGLN
ATASGKREAA SRSNETPLEA YRRKLDRSPD AINEQVESYV VEELDNVRDG IRGALEKAIG
TDDPLISGLL NLLIEQVILR PLAEALASAS GGGGGFLGAV ASGIGSLFGR ASGGYVAPGQ
MVRVNEGASP GRVEGFIPQG GGHIVPLGRM NALRQAGGQK VFQISIDARN SVTPDGFARE
LSSQILRQAA AMDGQTAQAV LKAAPGRMSQ YQRDKI