Gene Saro_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1341 
Symbol 
ID3917791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1382689 
End bp1383666 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID640444079 
Productaminodeoxychorismate lyase 
Protein accessionYP_496619 
Protein GI87199362 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCC GCCGCCGATC CAGATTGCCG CTGCTGGCGG CGGCCCTTGT CGTGCTGGTC 
GTGGCGGCGT GGCTCGTCGG CGGCTGGTTC TCGTCTGGCC CGCTGGAAAA GCAGCTCGAA
TTCGACGTGG GCGAAGGCGA GGGGCTGAGC GCGCTTTCGG ACGATCTGGA GGCGCAGGGC
GCCATCGGTT CGGCCACGCT GTTCAAGCTG CGCGCACGGC TGCTGGGCGG CGGCACCGAA
ATCAAGACCG GTTCGTTCCT GATCCCCAAG CGCGCGAGCG AAGCTACGAT CCTTGAAATC
CTCAAGGGCG ACAAGGTCAT CCGCCGCCTG ATCACCATCC CCGAAGGCAT GCCGTCGATC
ATGGTGGCCG AGCGTCTGCG CGCCGTGGAT GGCCTGACCG GCGATGTCGC GGTGCCCGAG
GAAGGTTCGG TGCTGCCCGA CAGCTACGAC TGGCAGAAGG GTGAAAGCCG CGCCGCCGTG
GTCAAGCGGA TGCAGGCGGC AATGGACAAG ACCCTGGCCG AACTCTGGGC AAAGCGATCG
CCGCGCACGG TCGCCAAGAC GCCGCAGGAG GCGCTGGTGC TGGCATCGAT CGTCGAGAAG
GAAACGGGCA AGCCCGAGGA GCGGCGCATG GTTGCCGGCC TCTACTCCAA TCGCCTGCGC
CAGCGCATGC TGCTTCAGGC CGACCCGACG ATCATCTATC CGATCACCGG GGGCAAGCCG
CTCGGCCGCC GCATCCGCCA GTCCGAGATC CAGGCGGTGA ACGGCTACAA CACCTATACG
ATGATCGGCC TGCCCAAGGG CCCGATCACC AATCCGGGGC GCGATTCCAT CGCGGCGGTG
CTCGACCCGG CGGAGACCGA TGCGCTGTTC ATGGTGGCCG ACGGTACCGG CGGGCACGTT
TTCGCGAGCA CGCTGCAGGA ACACAATGCC AATGTTGCCA AGTGGTTCGC CATCCGCAAG
GCTCGCGGCG AGATCTGA
 
Protein sequence
MARRRRSRLP LLAAALVVLV VAAWLVGGWF SSGPLEKQLE FDVGEGEGLS ALSDDLEAQG 
AIGSATLFKL RARLLGGGTE IKTGSFLIPK RASEATILEI LKGDKVIRRL ITIPEGMPSI
MVAERLRAVD GLTGDVAVPE EGSVLPDSYD WQKGESRAAV VKRMQAAMDK TLAELWAKRS
PRTVAKTPQE ALVLASIVEK ETGKPEERRM VAGLYSNRLR QRMLLQADPT IIYPITGGKP
LGRRIRQSEI QAVNGYNTYT MIGLPKGPIT NPGRDSIAAV LDPAETDALF MVADGTGGHV
FASTLQEHNA NVAKWFAIRK ARGEI