Gene Saro_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3066 
Symbol 
ID3916680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3284809 
End bp3286107 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID640445848 
ProductSel1 repeat-containing protein 
Protein accessionYP_498335 
Protein GI87201078 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAGCAG GACGGCGCGG CGCAGGACGG CGCGGCGCAG GACGGCGCGG CAGGGGGCAG 
CGCGGCAGGG GGCAGCGCGG CAGGGGGCAG CGCGAGCGCT CCGTGCCAGC ACGGATCAGG
AAACGGACTT TGGTGGCAAT GGCTTCGATC GGGTTGAAAA CACGCGGCGG CGGGGCCTGC
AGGCGACTTG CGCGATACCG GGTGCTGACC CTCGCGCTGG CAGCGATGGC CGTCGCCGCA
ACGCCTGCCC GCGCCGATGT GAAGGCTGGC GTCGACGCGT GGTCGCGCGG CGATCACGCC
GGCGCGGTCA AGGAATGGCT CGGCCCGGCA GCCAGGGGCG ATGCCGATGC GCAGTTCAAC
ATGGGTCAGG CCTACAAGCT CGGGAAGGGC GTGACGCAGG ACCTGAAGCG CGCGGAAGCG
TGGTATCGCA AGGCGGCCGA ACAGGGGCAC ATCAAGGCGG GAGACACGCT CGGCCTCCTG
CTCTTCCAGG AAAACCGCAA GGCCGAAGCC CTGCCCTACC TGACGGCCTC GGCCTACCGG
GGGGAGCCGC GCGCGATGTA CATTCTGGGC ATCGCCCACT TCAACGGGGA CACTGTCGGC
AAGGACTGGG TGCGCGCCTA TGCGTTGATG AGCCGCTCGG CCGCGACCGG GCTCGACCAG
GCGACGCGCG GATTGGCCAC GATGGACGAG ATCATCCCGC TCGACCAACG CCAGTTGGCG
ATGTCGCTGG CCACGGAGCT GGAACAGAAG GCGCAGGCGA ACCGGGCAAG GGAATTTGCC
GCAGCCGATC TTGGCGTGAA AGCAGGTGCG CCCGCGCCAA TGCGCCCGCA GCAGGCACCT
GCTCCGCTCC AGCGCGCGGA ACTTCCGCCG TCAACGCCTT CGGTCGCCGC TCCCGTCACG
GCGGGCGCCG ACTTTGCCGA TCCGGTTCCG ATACCGACGC CCCGTCGCGT TGCCGCGAGC
CAGGCCAAGC CGGATGCCCC GCGCGAAGCC GCGCCACCCG CCGCAAGGGC AAAGCCCGCC
GCGCCCACGC AACCCAGGAA GGCCGCACCC TCCGCTTCAG CACCCAAGGC AGACGGCAAC
TGGCGCATCC AGTTCGGCGC ATTCGGAGTG AAGAGCAACG CCGACGCCCT GTGGGCGAAA
GTGAGGAATC GCGCCGAAGT CGCAGGGCAT GCCCGGATCG ATCTGCCCGC AGGCGGCGTA
TCGCGTCTTC TGGCGGGCGG CTACACCGAG AGCCAGGCCG ACAAGGCCTG CGCCGCGCTC
AAGGCTGGCG GCTTCAGTTG CCTGGTGGTA AAGCCCTGA
 
Protein sequence
MAAGRRGAGR RGAGRRGRGQ RGRGQRGRGQ RERSVPARIR KRTLVAMASI GLKTRGGGAC 
RRLARYRVLT LALAAMAVAA TPARADVKAG VDAWSRGDHA GAVKEWLGPA ARGDADAQFN
MGQAYKLGKG VTQDLKRAEA WYRKAAEQGH IKAGDTLGLL LFQENRKAEA LPYLTASAYR
GEPRAMYILG IAHFNGDTVG KDWVRAYALM SRSAATGLDQ ATRGLATMDE IIPLDQRQLA
MSLATELEQK AQANRAREFA AADLGVKAGA PAPMRPQQAP APLQRAELPP STPSVAAPVT
AGADFADPVP IPTPRRVAAS QAKPDAPREA APPAARAKPA APTQPRKAAP SASAPKADGN
WRIQFGAFGV KSNADALWAK VRNRAEVAGH ARIDLPAGGV SRLLAGGYTE SQADKACAAL
KAGGFSCLVV KP