Gene Saro_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1919 
Symbol 
ID3917142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2032081 
End bp2033382 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID640444665 
Producthypothetical protein 
Protein accessionYP_497193 
Protein GI87199936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0683862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTA CTTTCGGTCG CATGACTGGC GGCCTGATCG GCAAGACGGC AATGGCCCTC 
GTCCTTGCAG CTGGCGGCCT GGCCGTCGGC GCAACCGGCG CGGTTGCCAA GGAAAAGGAA
CAGAAGGTCG CCAAGGCGAC GAATTCGCCT GAATTCGGCA AGGCCGCGCA GACGCTGCAA
AAGCCCATCG CCGACGTAAC CGCCAGCAAG GACAAGGCTG CGGCGCAGGC GCTGATCCCC
CAGCTTGCGG CCATCGAAGC GTCGGTCAAG ACCCCACTGG ACCGCATCAT CTACGGTCAG
TGGCAGCAGC AGATCGGCGC CGCCGCAGGT GACAGTACGC TGCAGCAGAA GGGCCTCCAG
AACATGGTGG ATAGCGGCCA GCTTGGCGAC AAGGCTACGC TGGTTGCCTA CTACCTTGGC
ATGACCGCCT ATCAGAACAA GGACTATGCA ACCGCGTCGA AGGTGCTCGG CCCCCTCGTT
GCGGCCAACT ACAACGACGA TACCGCGGCT GAAGTCCTTG CCGACTCGTT CGCCCAGCAG
GGTCAGGCCC CGCAGGCTCT CGAGGCGCTG AAGGGCGCCG TGGCGGCCCG CAAGGCTGCA
AACGGCACCG TGCCCGAAGG CTGGTTCAAG CGCGCCAACC TCATCGCCTA CAAGAACAAG
CTGGCGCCCC AGGCTATCGA ATGGTCGACC ATGATGGTCG AGAACGACCC GACCCCGCTG
AACTGGCTTG GCGCTGGCCA GTTGGTTCGC GAGTTCGGCC AGTTCACCAG CCAGGAATCG
CTTGACCTCG GCCGTCTTCT GCTTCGCGCC GGCGGCTTCC AGAACGACCC CAAGTATGTC
GAGCGCGAAT ATGTCGAGTA CATCGAATCC GCCGACCCGC GTCGTCTCCC GGGCGAAGTC
CTGAAGGTCG CGGACAAGGG TGTGAAGGCC GGCGTCCTCA AGGCGAACGA TCCGTTCGTG
CTTGACGCGA TGACGCAGGC CAAGGGCCGT ATCGCTGCCG ACAAGGCCTC GCTGCCTGCA
CTCGACCGCG AAGCTCGTGC CGGCAAGGAC GGCAAGAGCG CGCTCGCCAT GGCAGACGCC
TACCTCTCGT ACGACGAAGC GCCCAAGGCC GAGGAAATGT ACAAGATGGC GCTGACCAAG
GGCGGTATCG ACAAGGACCG CGCCCTGACC CGTCTGGGCA TTGCCCAGAT CGACCAGAGC
AAGTTCGAGG ACGCCAAGGC CACCTTCGCG CAGGTTGGTG GCACGCGCGC TCCGCTCGCC
CGCCTGTGGC TGGCTTTCGC GAACACGCAG GCCCGCCCGT AA
 
Protein sequence
MRVTFGRMTG GLIGKTAMAL VLAAGGLAVG ATGAVAKEKE QKVAKATNSP EFGKAAQTLQ 
KPIADVTASK DKAAAQALIP QLAAIEASVK TPLDRIIYGQ WQQQIGAAAG DSTLQQKGLQ
NMVDSGQLGD KATLVAYYLG MTAYQNKDYA TASKVLGPLV AANYNDDTAA EVLADSFAQQ
GQAPQALEAL KGAVAARKAA NGTVPEGWFK RANLIAYKNK LAPQAIEWST MMVENDPTPL
NWLGAGQLVR EFGQFTSQES LDLGRLLLRA GGFQNDPKYV EREYVEYIES ADPRRLPGEV
LKVADKGVKA GVLKANDPFV LDAMTQAKGR IAADKASLPA LDREARAGKD GKSALAMADA
YLSYDEAPKA EEMYKMALTK GGIDKDRALT RLGIAQIDQS KFEDAKATFA QVGGTRAPLA
RLWLAFANTQ ARP