Gene Saro_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0022 
Symbol 
ID3916064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp17734 
End bp19323 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content61% 
IMG OID640442747 
Producthypothetical protein 
Protein accessionYP_495305 
Protein GI87198048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.575522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTCGAA TTTTATTAAG AACAGCCAGC GCCTTGGCGG CGCTTTGCCT CGCCATGCCA 
GTTCAGGCAA AATGGTACGA GGCATCAAGC AAGCACTTCG TGGTCTATTC CGACCGCGGG
ACTGAACAAG TCCGGATAAT GGCCTCCAAG CTCGAGCGCT TTGACGCAGC GCTGCGTCAG
CTTACCGGTC GGACTGACGA ACCTATCGCC CCGTCGAACC GGGTCACCAT TTTCATGGTG
CAGTCGCCGA TCGATGTCGC CCGCCTGTTC GGAAAGGGCG GAAACTACGT CGGTGGCTTT
TACTCCTCGA TCGCAGGCGC GTCATACGCG ATCGTCCCGC CATTCATTGT GCAGGAGACC
GGTGAGAACG GTTTCGCCGA GGGCGTATTG TTGCACGAGT ACGCGCACCA TTTCGTTTCG
GAAAACCAGT CGATCCTCTA CCCGATATGG CTGAACGAGG GATTCGCCGA GTTCGTCTCA
ACCGCACGCT TCGAGAAGAA CGGTTCGATC GGCCTCGGGC TGCCAGGGCA GTGGCGCGGC
TGGCAATTGA CGCACCAGGT CTCCGTGCCC GCCGAAATGC TGGTCGACAG CCAGGCCTAT
TTCGCGCGGC GCGAGTTGGC GTTCGACCAG TTCTACCCCC GCGCGTGGCT GCTTTATCAC
ATGCTGACGT TCGAGGCCTC GCGGGAAGGG CAACTCGCCG CATATGCAGG TGCGCTCAGC
CGCGGGCTGA ACGACCGGGA TGCCGCCATC GCGGCATTCG GCGATTTGCG TCAGCTCGAA
CTAGATCTTC AATCGTACGG CCGGAAGGTC ACGATGCCCT ACTACCTCAT AAAGGGGGAA
ACCTTGCGCC CCGCTCAGGT GAACGTACGG GAGCTGAGCA AGGAAGATGC CGAAGCGATG
CCCTTCTGGA TGCGCATTCG CCGAGGCTTG GAAGAAGGGA ACGCCGAACC GCTGGCGGTT
GAAGTTCGGG CAATGGCCGC GCGCAACCCC ACGAGTGCCT TCGTGCAGAC CATCCTCGGG
ATGGCGGAGT TCGAAGCCGA TCATGTCGAT GCTGCACTGG CCGCTGCCGA CACGGCAATC
GCAATCGACC CGATGGCGAT CGAGGCTCAC GTCCTCAAAG GGCGCGCGAT GCTGGCCAAA
GCGTCCCGGA ACGGCGCGAC GCCCGTCGAA TGGAACGCTG TGCGCGCGGC ATTTCTCAAG
GCGAACGCGA TCGACCCTGA TCACCCGCAA CCTCTGTTCC TGTACTACGC ATCGTTCTTG
AGTCAGGGGT TCAAGCCCAC CGGTAACGCC TTGCAGGCTC TCTCCCGCGC GCTGGAGCTC
GCGCCATACG ACCGTGTAAT CCGCGCGGAA CTGGCGGAGA GTCAAGTTTC TCGCGCCCAA
TTCGAGGAGG CGAAGTCGAC GATACGCATG CTCATGCGTG ATCCGCATTC TCCACTGGGC
GGCGCCCGCA TCCGCGCGGT CATGGAAAAG CTCGACGAGC GCAACGCCGA AGGTGCTCGC
AAACTTCTCG AGATGAGCAA CGAGGAGTTC AAGGCAAAGC AGGAGAAGGG TGACCCAGAC
AGTGGCGCCG GTCGCGAGGC GGCTGCCTGA
 
Protein sequence
MRRILLRTAS ALAALCLAMP VQAKWYEASS KHFVVYSDRG TEQVRIMASK LERFDAALRQ 
LTGRTDEPIA PSNRVTIFMV QSPIDVARLF GKGGNYVGGF YSSIAGASYA IVPPFIVQET
GENGFAEGVL LHEYAHHFVS ENQSILYPIW LNEGFAEFVS TARFEKNGSI GLGLPGQWRG
WQLTHQVSVP AEMLVDSQAY FARRELAFDQ FYPRAWLLYH MLTFEASREG QLAAYAGALS
RGLNDRDAAI AAFGDLRQLE LDLQSYGRKV TMPYYLIKGE TLRPAQVNVR ELSKEDAEAM
PFWMRIRRGL EEGNAEPLAV EVRAMAARNP TSAFVQTILG MAEFEADHVD AALAAADTAI
AIDPMAIEAH VLKGRAMLAK ASRNGATPVE WNAVRAAFLK ANAIDPDHPQ PLFLYYASFL
SQGFKPTGNA LQALSRALEL APYDRVIRAE LAESQVSRAQ FEEAKSTIRM LMRDPHSPLG
GARIRAVMEK LDERNAEGAR KLLEMSNEEF KAKQEKGDPD SGAGREAAA