Gene Saro_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1961 
Symbol 
ID3917277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2079602 
End bp2080741 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID640444709 
ProductErfK/YbiS/YcfS/YnhG 
Protein accessionYP_497235 
Protein GI87199978 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.975073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGCC ACACAGCTGC CGCGCTCGCC ATCCTAGCCC TTGCCGCTTG CCAATCGGAG 
AAGTCGGACC CCGCTCCCGA TAAAAGCGCG GCAGCGCAGC CAGCGAAGCC AGCGTTCGTC
GTGCCCATGG CTTCGAGCGA TCCTGCGCCC GGCCAGCCCG CACCGGCCGA GGACATGCCG
CGGCCGGTGA TGCAGGCACA GGTCGTACTC GAACGGCTTG GCTTTGCGCC CGGGATCATC
GACGGCAAGG AAGGACTGAG CACTCGAAAC GCGGTCCAAG GGTTCCAGGA AGCCAATGGC
ATTGCCGTCT CTGGCAATTT CGATCGCGCC ACGATGCAGG CGCTCGCCCG CTGGTCCAAC
ATACCCGCCA CCCGCCTGGT CACCATCCCG GACGATTTCG CGCACGGTCC GTTCGCGCCG
TTGCCAAAGG AACCTGCAGC CCAGGCGAAG CTGAAGGCTT TGGGCTACGC CTCGCTCGAG
GAAAAGCTCG CCGAGCGCTT TCATACTACG CCTGAAGTGC TGCGCGCTCT CAACGCACCT
CCGGTGCAGC AGCCGATTGC GTCAAGCGCT GCGGTGGAGG CAACGAGGCG AACGCCGGTG
ATCTATCGTG CGGGCCAGCA GATCAGGGTT CCGAATGTCG GCGCCGATGC CATCGATCCT
GTGGCGGTCG GGGACCAGGG CGCACTCGAA ACCATGGCTT CGCTCGGCGT CGGTTCGAAC
CAGCCCAAGG CGGGGCGCAT CGTTGTGAGT GAACACGCGG GGACGCTGAA AGCCTTCGAC
GCATCGAACA AGCTTGTCGC GCTTTTCACC GTCACCACCG GTTCCGAACA CGATCCGCTG
CCGTTGGGGA ACTGGAAGAT CTACTCTTCC TCGTTCAATC CTCATTTCCG TTATGACGCC
AGCCTGTTCT GGGACGTCCC CGACAGCAAG GGCGAGCATC TCCTGCCGCC GGGGCCCAAT
GGACCGGTGG GTGTAGTGTG GATCGACCTG TCGAAGGAGC ACTATGGCAT ACATGGCACC
CCGGAGCCGC AGACCATCGG CCGGACGGAA AGCCACGGCT GCGTCCGCCT CACCAATTGG
GATGCCGCGC GGCTCGCGCT CATGGTCGAC GGCGCGACGA AGGTTTCATT CGTAAGGTGA
 
Protein sequence
MLRHTAAALA ILALAACQSE KSDPAPDKSA AAQPAKPAFV VPMASSDPAP GQPAPAEDMP 
RPVMQAQVVL ERLGFAPGII DGKEGLSTRN AVQGFQEANG IAVSGNFDRA TMQALARWSN
IPATRLVTIP DDFAHGPFAP LPKEPAAQAK LKALGYASLE EKLAERFHTT PEVLRALNAP
PVQQPIASSA AVEATRRTPV IYRAGQQIRV PNVGADAIDP VAVGDQGALE TMASLGVGSN
QPKAGRIVVS EHAGTLKAFD ASNKLVALFT VTTGSEHDPL PLGNWKIYSS SFNPHFRYDA
SLFWDVPDSK GEHLLPPGPN GPVGVVWIDL SKEHYGIHGT PEPQTIGRTE SHGCVRLTNW
DAARLALMVD GATKVSFVR