Gene Saro_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2120 
Symbol 
ID3918783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2258634 
End bp2260151 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content69% 
IMG OID640444873 
ProductTPR repeat-containing protein 
Protein accessionYP_497393 
Protein GI87200136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC TGCGTGGGTT GATCGTGCCG CTGGTGTTGC CGTTCGTGCT CGCCGGATGC 
GGGGCGAGCC CCGAGGAGCG GGCCGAGCGA GCACGCAAGG CATTCGAGAC GCATGATTTC
CGTGCGGCGC AGGTTGATAT TGCGGCGGCG CTGGAGGCGA AGCCCGGCGA TGCCGCGCTG
ATCGAATTGC AGGCGCGCAA TGCGCTGGCG CTGGGTGACG GCATCGCGGC CGAGGCTGCG
CTTTCCCGCC TGGCGGAAGG GCAGCGACCG GCTGACTTCG CGCAACTGAT GGGCGAGGCG
GCGCTGCTGC GGCAGATGCC CGACGAGGCG CTTTCCGCTA TCGGCAACGA CGCATCGCCC
TCCGCACAAC GTATCCGCGC GCTGGCCATG CTCGCCAAGG GCGATCGCGC GACGGCGGAG
GCGGCGTTCG CGGCGGGGGC GCAAGGGGCA AGCGATGCAC GCCTGCTGGC GGACTACGCG
CGGTTCAGCC TGATGGGGGG AGACGTGGCA AAGGCGCGAG CGCTTGCCGA CCGGGCGGTC
AAGGCTGCCC CGGATCTCAT CGATACGCTG CTGGCGGACG CCGAAGTATC GGTCGCGCAG
GGCAAGCTGG CGCAGGCCCT GGCGACCTAT GACAGGGCGG CGAAGGACTG GCCGGGCAAT
CTTGCCGCGC TGGCCGGGAA GGCGGCGGTG CTGGGCGACC TGGGACGGAC GAAAGATATG
GAGGCGGTCC TCGCCTCGCT GGCCGAAGTG AAGGGCGGCG GGCAGGTCGC CTATCTCCAG
GCGCGCGCGG CTGCGGCGCG GGGGGATTGG AGCACCGTGC GCAGCGTGCT TCAGGCCAAC
GAGAAGGCTC TGGAAGGCAA GGACGAGGCG ACCGTGCTCT ATGCCCAGGC GCTGGTGGCG
CTGAAGCAGC CGGAGCAGGC GCGCGCCCGG CTCCAGCCAC TGCTGACGCG CAATCCGCAA
AGCGCGATGA TACGGCGCGA ATTGGCCAAG GCCCAGCTCG CCGCAGGCGA TGCGCGCGGC
GCGGTCGAGA CGATGCGGCC GTTTGCCGAA GTGCAGACCG CCGATGCGGA AGACTTGCGC
CTGCTGGCAA GGGCCGCGGC GGCTTCAGGC GACCCGGAAG CGGCGAAGCT GGCCGAGAAG
GCGAAGTATC CTTCACCCCA GGCACTGGCG GCGACCTTGG CGCAGGCCGA TACGGCGATG
AAGCAGGGCA ACTGGGGCAA TGCCGTTGCC GCCTACGATC GCATCCTGGC GGTGACGGAT
GGTTCGAACG CGCTGGTGCT GAACAACATG GCCTATGCGC AGGGGCAATT GGGCAACAGT
GCCAAGGCGC TGGACTTCGC GGAACGTGCG CTGAAAGCGG CGCCGGGCAA TGCCTCGGTC
ATGGACACAC TGGGCTGGCT GCTGGTCGAG AGCGGCAAGG ACAAGGCGCG CGGGCTGAAG
CTGTTGCAGG ATGCGGCGGC CAAGGCGCCG GGCAATGCAG CGATCCGCCA GCACCTCGAC
AAGGCGCGGC AGGGCTAG
 
Protein sequence
MKNLRGLIVP LVLPFVLAGC GASPEERAER ARKAFETHDF RAAQVDIAAA LEAKPGDAAL 
IELQARNALA LGDGIAAEAA LSRLAEGQRP ADFAQLMGEA ALLRQMPDEA LSAIGNDASP
SAQRIRALAM LAKGDRATAE AAFAAGAQGA SDARLLADYA RFSLMGGDVA KARALADRAV
KAAPDLIDTL LADAEVSVAQ GKLAQALATY DRAAKDWPGN LAALAGKAAV LGDLGRTKDM
EAVLASLAEV KGGGQVAYLQ ARAAAARGDW STVRSVLQAN EKALEGKDEA TVLYAQALVA
LKQPEQARAR LQPLLTRNPQ SAMIRRELAK AQLAAGDARG AVETMRPFAE VQTADAEDLR
LLARAAAASG DPEAAKLAEK AKYPSPQALA ATLAQADTAM KQGNWGNAVA AYDRILAVTD
GSNALVLNNM AYAQGQLGNS AKALDFAERA LKAAPGNASV MDTLGWLLVE SGKDKARGLK
LLQDAAAKAP GNAAIRQHLD KARQG