Gene Saro_3998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3998 
Symbol 
ID5077528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp165669 
End bp167783 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content65% 
IMG OID640481103 
Productmating pair stabilisation TraN 
Protein accessionYP_001165765 
Protein GI146275604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCGC CGCGCCTTGC TGGATTTGCG CTTGCCGGGA TGGTGCTGCT CGCCGCGGCC 
GGCGCACAAG GACAGGTCTA CATCCCCCCG CCCGACGATC TGGTCGAGCC GCAACCGCCG
CTCCCGCCCG CGATTGATCC CGGCGTGCCC GCTCCCGCCC CGCCGCCTCC TTCCGCGCCT
GCCACCATGA CCGTGGACGA GGCCAAGGCC GAGGCTCGGG CGACCGGATC GAGCGTGCGC
GGCGCCTACC AGGGGATCAC CGATGCCCCG GGTGCGGCCG GGCAGATACC GGGGTATCAG
GCGGAATATC CCGGACTCAC CCAGTATTAC GACAATCCCG GCAACATGTA CGCCGATGGG
GCGGCGGCCG GCGTGAATAG CAATGCCTAC CGCACCGCCA ATTCGACCAC GCGGCCGACC
GTCGATGTCA CCCGCGCCGA CCTGTCCCGC GCCAACACCG TGACCGACGA TCCCAATGCC
TACTTGTCTG GCATGAGCGC GGACGGGTCG ACCGGCAATT GCGTCCCGCT GCCGCCGAGT
CCGGGGACCA CCAACACTGC CGAATGGACC TGCAATGTCG GCTCGAGCGT GGTCGAACAG
CCCAAGACCT GCACCCGCAG TCTCACCGTC GCGCCATGGA ACGAGACGCT CTACCAGTAC
CTCTGCGTCA CGGCCCCTGG CTTTCCGGGC TGCGCCTCGC TTGAGGGCAA CGCCCTGTGC
CGCAAGACCG GGACCTTCCC TGTTCCCGAC TATAACCTCA CGGTCGACTA CTACGACTGC
GACGCAGGCG TCAGTGATCC CAACGTCTAC CTCATGGGCA CGGTGGCCAA ACCGCCGCCG
GCGGACGCCT TCCAGGTCGT GAGCAACGTC TATCGCTGCA ACAACGAAGG GATCACCGAT
GCGCTGACCT TTGATCCCGT GACGGGTTTT CCCGTCCAAT ATGTGAGCGG GCTTCAGCAG
TGCGGGGCGA TATCGGCGGA GCCGAGCTGT ACCCAAACCA CCGCCTCCGC GGCAGGCCTC
ACCGACCGCC AGCTCTGCAA GACCTGGGAT TTCATCGGCG ATCCGTTCGG GGGAGGGGGC
TACCTCACTT GCCTTGAGCC CGCTTCGCTA GAAGCGGTCT ATTCGTGCAG CACCAATGTC
GCCGGCATCG TGCCCGAAAG CTCGGTATCG AAGTGGTTCA CCCAAGTCTG GACCGACAAT
GCCTGCTCGG TCGATCTTGG CACCTGCACC CTCGCTGCCG AGACCTGCAC TGCCCCCAAT
GAAACCCGCC TGATCGATGG CGTGCCGGTC ACGCGGGCCT GCTGGGAGAC GGCGAAGACC
TACCAGTGCC AGACCGTGGT GGGCGGCGGC AACGACTGCG GCAAGCTCGA TGCAACCCCC
GGCTGCATGT TCGACCACGA GACCTGTCTC GACGATCCGC CCAGCGGAGA TGGTTCGTGC
AAGGTCGCCG AGCGTGTCTA CAAGTGCCCG ATCCCGGGAT CGACGAGCCA GCCTGCGCAG
TACATCTGCG GGGACGACGT CTACTGCGTC AATGGCGACT GCGAGCCGAT CGTTCGCGAG
GCCTCCGACG AGTTCAAGGA CGCGGTCGTG GCGCTGAACG CGCTCGGACA GGCCAATTCC
GAGTTCGACG AAAGCACGCT GACGCTGTTC AAGGGCACCG CCGAGAGTTG TGCGCACAAG
GTCTTCGGTC TCGCCAACTG CTGCTCGGGG AAGGGCGTCC CGCTCCTCGT TCCGCTGCTG
TGCAGCCCGG CCGAGGTGCT GCTCGATCAG AAGGACGATG CAGGGCTCTG CCACAAGATT
GGCACTTACT GCTCGTCGAG CTTCCTTGGC ATCTGTCTTT CCAAGCGCGA TGTCTATTGC
TGCTTCCTCT CGAAGATCAG TCGGATCCTC CAGGAGCAGG GCCGGCCCCA GATCGGCAAG
ACATGGGGAA CGCCGAAGAA GCCCGTCTGC GACGGCTTCA CCATTTTCGA GTTCCAGCAG
CTCGACCTCT CGGTGATGGA TTTTTCCGAA ATCTACGCGG AGTTCGTCGA TGCGGCGAAG
CTCCCCGACG AAGCCGCCAC GCTCATCGAA ATCCAGGCAA AGATCGAGGC CTATTATGCC
GCGCACAAGC CCTGA
 
Protein sequence
MRAPRLAGFA LAGMVLLAAA GAQGQVYIPP PDDLVEPQPP LPPAIDPGVP APAPPPPSAP 
ATMTVDEAKA EARATGSSVR GAYQGITDAP GAAGQIPGYQ AEYPGLTQYY DNPGNMYADG
AAAGVNSNAY RTANSTTRPT VDVTRADLSR ANTVTDDPNA YLSGMSADGS TGNCVPLPPS
PGTTNTAEWT CNVGSSVVEQ PKTCTRSLTV APWNETLYQY LCVTAPGFPG CASLEGNALC
RKTGTFPVPD YNLTVDYYDC DAGVSDPNVY LMGTVAKPPP ADAFQVVSNV YRCNNEGITD
ALTFDPVTGF PVQYVSGLQQ CGAISAEPSC TQTTASAAGL TDRQLCKTWD FIGDPFGGGG
YLTCLEPASL EAVYSCSTNV AGIVPESSVS KWFTQVWTDN ACSVDLGTCT LAAETCTAPN
ETRLIDGVPV TRACWETAKT YQCQTVVGGG NDCGKLDATP GCMFDHETCL DDPPSGDGSC
KVAERVYKCP IPGSTSQPAQ YICGDDVYCV NGDCEPIVRE ASDEFKDAVV ALNALGQANS
EFDESTLTLF KGTAESCAHK VFGLANCCSG KGVPLLVPLL CSPAEVLLDQ KDDAGLCHKI
GTYCSSSFLG ICLSKRDVYC CFLSKISRIL QEQGRPQIGK TWGTPKKPVC DGFTIFEFQQ
LDLSVMDFSE IYAEFVDAAK LPDEAATLIE IQAKIEAYYA AHKP