Gene Saro_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1194 
Symbol 
ID3916491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1239972 
End bp1241567 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content71% 
IMG OID640443930 
Producthypothetical protein 
Protein accessionYP_496473 
Protein GI87199216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCACG CCGCCGATCC GCATCAGCCC TCCACCATTG CCACATCGTC TTTCGGACGC 
GGGTGGATTC CCGCCGGGCG TGACATCGGG CTTCCTGACG ACGACTTCCC ACCATTCAAT
GCGAACGGGC AGGCGGATTG GGCGGGGTGG GTCTCTCATG TAGCGAAGCG CAGTGCAGCA
GAGCCATTAT CGGACTCGGG GCAACGCCCG AGCGCGAGTG CTGGTGCTCC CCTCCCGTTT
GCGGGAAGCG GGATAGAGGG CAGCGCACTC CCGCACCCCA ACGCCTTCAC CCCCGTCCGC
ATCGCCCGCT TCCTCGAAAG CCTCAGCCGG TGCGGGCAGG TGCGCAACGC CGCGCGCGTG
GCGGGGGTGT CGCAGCAGAC GGCCTATGTC CGCCGCCGCC GCGATCCCGC CTTCGCCGCC
GGGTGGGATG CGGCGCTGAT CCTGGCGCGC GAGGCGGCCG AGCAGGTCCT TGCCGAACGC
GCCCTGTGCG GCATCACGGA AACGATCTGG TTCCGGGGCG AGGCGGTGGG CGAGCGGCAG
CGCTTCGACG GGCGCCTCCT CCTCGCCCAT CTCGCCCGCC TCGATGCGCG GGTCGCGGCG
GCGCCCGGTG CCGTCCACCA CCTTGCCGAG GACTTCGACG CCATGCTCGT CGCGCTGGCC
GGGGGCGAGG AGCCGGCAGA GGCTGTAGAC TGGCCCGACC CCGCCCGCGA CGATCATGTC
GAGGCGCGTG CCGATGCCGC GGCAAGCGCC TTCGATCATG CCCATCCCGA ACCGGAAGAC
CCGCTCGACG ATGCCGCCTG GGATGCCTGG CAGTCCGCCC GCGCTGCCGC ATCCGACGCC
GCGCGCCTTG CTGCGCAAGC CGAATGGCGC GCCGCCGCGC AAGGCCGCGA CGCCAGCCTC
GCGACGCTGC TTGATGCCCC GCTCGAACGC AAGGCTGCCG GCACCGTCGT GACGGCGGCT
GCGGCAGAAG TCGGGCCGGG GTCCGGGCCT CAGCTACAGC GCCAGGCGGA ATGCCCCCAG
GACAGTGTAA ACACCGTCAA CCCTGCGCCT GCCGCGCCCG TCCCGATTGC GCGAGGACAA
TTCCCGCCGT CGCCGCACGG GTGTAGGCTG GCTGCCGCGC CGGAGAGTCG TCACGCCTTC
CGGCCTCTCA AGGGAGACAA GATCATGAGA CGTGCAACAT CGACCGTCGC GATTGCCGCA
GCGCTCGCCG CCACCGCGCT TGCCAGCCCG GCGGTCGCCA AGCCGGTCAC GCTCACCGCC
AGCCTTGCCG GTGCGGCCGA GACGGGCGGC GGCGATGCCG ACGGCGTGGG CGGCTTCAAG
GTCGAGGCGG ACGATGATTC CGGCGATTTC TGCTTCACGC TCTGGGCCGA AAAGATCGCG
GCGCCGACCA TGGCCCATGT CCACGAAGGC GCGGCGGGGG CCGACGGCAA GCCCGTCGCC
ACGATCGAGG TCACCGGCAA GGACAGCGAC GCCTGCGTCG CGATGGAGCC CGAACTGATC
AAGAAGATCC TCGCGGCGCC CGGCGACTAC TACGTCAACG TCCACACCGG CGATTTCCCC
AAGGGCGCTA TCCGCGGCCA GCTCCAGAAG CCCTGA
 
Protein sequence
MPHAADPHQP STIATSSFGR GWIPAGRDIG LPDDDFPPFN ANGQADWAGW VSHVAKRSAA 
EPLSDSGQRP SASAGAPLPF AGSGIEGSAL PHPNAFTPVR IARFLESLSR CGQVRNAARV
AGVSQQTAYV RRRRDPAFAA GWDAALILAR EAAEQVLAER ALCGITETIW FRGEAVGERQ
RFDGRLLLAH LARLDARVAA APGAVHHLAE DFDAMLVALA GGEEPAEAVD WPDPARDDHV
EARADAAASA FDHAHPEPED PLDDAAWDAW QSARAAASDA ARLAAQAEWR AAAQGRDASL
ATLLDAPLER KAAGTVVTAA AAEVGPGSGP QLQRQAECPQ DSVNTVNPAP AAPVPIARGQ
FPPSPHGCRL AAAPESRHAF RPLKGDKIMR RATSTVAIAA ALAATALASP AVAKPVTLTA
SLAGAAETGG GDADGVGGFK VEADDDSGDF CFTLWAEKIA APTMAHVHEG AAGADGKPVA
TIEVTGKDSD ACVAMEPELI KKILAAPGDY YVNVHTGDFP KGAIRGQLQK P