Gene Saro_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2356 
Symbol 
ID3915701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2502950 
End bp2504809 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content67% 
IMG OID640445111 
Producthypothetical protein 
Protein accessionYP_497626 
Protein GI87200369 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.805345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACTGA CGCGTCGCGT TTCGACCTGC GCGATCGCAC TGGCGATCAT CCTTGCCGGG 
TGCGCACCGG CGACGGCCCG GACAGGCGCT TCGGCCGACA CGATACCGAC TGCCCGGATC
GACGCGGCAC CGCCTCCCCT GCCCGTCACT TCGGCCAACG TGGCGCTGGA CCGCCTGTTC
GCGGAAGATG CGCAAGTCTC GATGCAACTC GACCCGCTCG GCTCGCTGGA GCAGGGACAC
AAGGTCCCGG TCGAGCGGTT CGTGCTGCTG TTCACCCCGG AACTCATCCG CGAACGGCGC
GAGGCGAATG CCCGGTCCCT GGCCGAGCTG GCCAGGATCG ATCCGGCAAA GCTCGATCGC
AACCGCCGCA TCTCGCGCGC AGTATTCGAG GATGCAAAGC GGAACGAACA GGCGCTTCTC
GCACCCGACG TCCAGCCACT TTTCGCGGCG CAGCCCTTCA ACCACTTCGG CGGCTTCCAT
GTCGCCTATC CGGAGCTTTC CGCACCGGGC AGCGGCATCG CACTCGACAC GGTCGAGGAT
TATCGGCTGC TCATTGCGCG GCACAAGGCA CTGCCGCAGG TCTTCGGCCA GGCCATCGCC
CGGTTCCGGG AAGGCATGGC CAGCGGGGTT ACCGAGCCTC GGCTGACAGT CGACAACATG
ATCGTGCAGA TCGACGCGCT GCTGGCCCAG CCGGTAGACC GTTCGCCATT CCTGGCTCCT
GCGCGGCAGT TTCCGGACGA TGTGCCGGCA GCCGAACGCG CCAGGCTGGC GCGGGAGCTG
GCGACGGTGG CGCGGCGCGA GATCTATCCG GCCTATCGGA CGCTGCGCCG CTTCCTAGCC
AACGAATACC GGCCCGTGGC GCGCGAGCAG GTCGGCCTTT CGGCGCTTCC AGACGGCGAA
CGGCTATACC GACTGCTGGC GCGGCAGCAT ACCACTGTGG ACCTCGACCC GGCGGCGGTG
CACGAACTGG GTCTATCTGA GGTGGCACGC ATCCAGTCCG AAATGGAGGA CGTGAAGCGC
CAGCTCGGTT TCCAGGGCCC GCTGCGCAGC TTCTTCGACC ACATCCGGAC CGACCCGAAG
TACCACCCGC ACACGGAGCG GGAACTGGCG GAGGGGTTCC GCGCCGTGGG CCGCAAGGTC
GACGCGCTGG CGCCGCAGTA CTTCCTGCAC CTGCCCCTCA CACCGCTGCT GATACAGCCC
TACCCCGCCT ACCGCGCACG GTTCGAGGCA GGCGGCAGCT ATGCGCAGGG ATCGGCTGAC
GGGAAGCAAC CCGGCGTATT CTTCTACAAT ACCTATGACC TGAAGAGCCG CTTCCTGACC
GGCGTTACCA CGCTCTATCT CCATGAAGGC GCGCCGGGGC ATCACTTCCA GATCAGCCTG
GCGCAGGAAA ACGCGAACCT CCCGGACTTC CAGCGCTTTG GCGGCAACAC GGCCTATATC
GAAGGCTGGG CGCTCTATGC GGAGACGCTG GGCTACGAGA TGGGGTTCTA CAAGGACCCG
ATGCAGCACT GGGGCACGCT CGACGACGAA ATGCTGCGCG CGATGCGGCT CGTCGTGGAC
ACCGGCCTTC ACACCAGGGG ATGGAGCCGA GAAGAAGCGG TCGATTACAT GCTGGCCAAT
TCCGGCATGG GCCGCACCGA TGCGCAGGCC GAAGTCGACC GCTACATCGC CAACCCGGGT
CAGGCGCTGG CCTACAAGAT CGGAGCGCTG ACGATCCAGC GCCTGCGCCG GGAAGCGGAG
GCGGCACTGG GCCGGCGCTT CGACATCCGC CAGTTCCACG ACCAGATTCT GGGGAGCGGC
GCGCTGCCGA TGCCGGTTCT CGAGGCCAAG GTGCGGGGTT GGATCGCCGC CACGCGTTGA
 
Protein sequence
MALTRRVSTC AIALAIILAG CAPATARTGA SADTIPTARI DAAPPPLPVT SANVALDRLF 
AEDAQVSMQL DPLGSLEQGH KVPVERFVLL FTPELIRERR EANARSLAEL ARIDPAKLDR
NRRISRAVFE DAKRNEQALL APDVQPLFAA QPFNHFGGFH VAYPELSAPG SGIALDTVED
YRLLIARHKA LPQVFGQAIA RFREGMASGV TEPRLTVDNM IVQIDALLAQ PVDRSPFLAP
ARQFPDDVPA AERARLAREL ATVARREIYP AYRTLRRFLA NEYRPVAREQ VGLSALPDGE
RLYRLLARQH TTVDLDPAAV HELGLSEVAR IQSEMEDVKR QLGFQGPLRS FFDHIRTDPK
YHPHTERELA EGFRAVGRKV DALAPQYFLH LPLTPLLIQP YPAYRARFEA GGSYAQGSAD
GKQPGVFFYN TYDLKSRFLT GVTTLYLHEG APGHHFQISL AQENANLPDF QRFGGNTAYI
EGWALYAETL GYEMGFYKDP MQHWGTLDDE MLRAMRLVVD TGLHTRGWSR EEAVDYMLAN
SGMGRTDAQA EVDRYIANPG QALAYKIGAL TIQRLRREAE AALGRRFDIR QFHDQILGSG
ALPMPVLEAK VRGWIAATR