Gene Saro_1619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1619 
Symbol 
ID3918727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1688627 
End bp1692235 
Gene Length3609 bp 
Protein Length1202 aa 
Translation table11 
GC content61% 
IMG OID640444359 
Productcadherin 
Protein accessionYP_496893 
Protein GI87199636 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.42147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTACA TTCAGATCGA TGGTTCGCTG TCGGACTGGT CCTCGAACCT GAGGATCGAC 
GCGGGCGCCG TGGACGGCTA CCAGATCTAT GCCACGACCG ATGCGACCGA CTATATCTTC
GCTTTCGCAG CGCCCACGGC GGTTGGTGCG AACACGACGA TCTGGCTCAA CACGGATCTC
AACCAGGCGA CCGGCTACCA GCTCTGGGGG ACTGTTGGCG CCGAGTTCAA CGTCAACTTC
AAGTCCGACG GCTCTGCAGC ACTTTATTCG GGGGCAGCCG GCGGAACGCT TGTCGCGGAC
AACCTCGTCC TTGCCTACAA CGCCGATAAA ACAATGGTCG AGCTGCGCGT TCCGAAGGAT
CTGCTTGGCA ACCCCGGTTC GATCGACACC GTGTACGACA TAAACGACAC GGCGATCATT
CCCAGCTTCT ACCAGGACAA CGCCATCCGG GTCTGGGACG ATTCCGAGCT GGCCAGCGTC
ATCCCGGCGA CAGATACCCG TATTGCGATC GTCTATTCAG CGACGACGGC GGCCAACTAC
TTCAGCCAGA CTGCCTATTC GGACCTGTTC ATGGCCGCCC AGTCGCAGGC GGCGCAGGCT
GGCGTGCCCT TCGACATCAT CACCGAGGCC GACCTTACCG ACATCAACAA GCTCGCCCAG
TACAAGGCGA TCGTCTTCCC CTCGTTCCGC AACGTGCAGG CAAGCCAGGC CGACGAGATC
GCGCACACCC TTCAGCTCGC ATCGCAGGAG TTCCACGTCG GCTTCATCGT GTCCGGCGAG
TTCATGACCA ATGACGAGAA CGGGAATGCC ATGGCCGGCA ATTCCTATTC CCGCATGGCG
ACGCTGCTTG ACGCCACGCG CGTTACGGGT GGCACCGCTA CGTCACTGAC GGTCACGGCC
ACCGACCCGA CCGGCGTCGT CCTTGATGGC TATGCCAACG GGGAACTGGT CAATCAGTAT
GCCAACGTCG GCTGGAACGC TTTCCAGAGC GTGAGCGGCA CCGGCCAGAC CATCGCCACC
GAAACCATAA ATGGTTCGTC GACATACGCT GCGGTTCTCG CCACCCAGAC TGGTGGTCGC
AACGTCCTCT TCTCGAGCGA TGCGGTGATG GCCGACGCCA ACATGCTGCA ACGCGCCATC
GATTATGCCG TGAGCGGCGA GACGGTGACC GTTTCGCTGA ACATGACCCG CGACGCCGGC
CTTGTCGCCG CGCGCGTGGA CATGGACCAG AGCATGTACA TCGAGGACGT CAACGGCGGC
ATCTATGACC AGCTCGTTCC GCTGCTTCAG CAATGGAAGG CGCAATACAA CTTCGTCGGC
TCGTTCTACG TCAACATCGG CGACAACACC CAGCAGGGGA TCTATACCGA CTGGAACAAG
TCGCTGCCGT ACTACACGGC GATGATCGGC CTCGGAAACG AGATCGGCAC GCACACCTAT
ACCCACCCCG AAGACACCAA CCTGCTGAGC CCGTCTCAGT TGCAGTTCGA ATTCGAGCTG
AGCACCCAGA TCCTCGAGCA GAAGCTCAGC GCGGCGCTGG GTTATGCCTA TACCATCGAA
GGTGCCGCGA TCCCCGGCGC GCCGGAAACG CTGACGACTT CGCTGGCCAT CGAACAATAC
GTCAAGACCT ATCTGACTGG CGGCTACACT GGTCAAGGCG CGGGCTATCC CAACGCCTTC
GGCTATCTGA CGCCCGGTAG CCAGGACAAG GTCTATATCG CGCCGAACAC CTTTTTCGAT
TTCACCCTGT TCGACTGGCT GCACCTTTCG GCGGCGGATG CCAGCGCGTT GTGGCAGTCT
CAGTACGAGA AGATCGTCAG CCAGGCCGAC TCTCCCGTCG TCGTCTGGCC ATGGCACGAT
TACGGTGCGA CGGCGTTCAA TTCGCCCAAC TACGCGCCCG AAATCTTCAA CACGTTCCTT
GCCCAGGCCG CAGCCGACGG CATGGAATTC GTGACCCTCG CCGACCTGGC CAATCGCATC
AACGCCTTCC ATGGCGCCAA GGTGACGACT TCGGTCTCAG GCAACACGAT CACCGCCAAT
GTCACCGCTT CCGGCAACGT CGGTACGTTT GCTTTCGATC TGCAGGGGCA GGGCAGCCAG
GTCATTTCGA GCGTGGCCGG TTGGTATGCA TACGACAGCG ACAGCGTGTT CCTGCCCCAG
AATGGCGGCA CGTTCGTCAT CACCCTGGGC GCGGCGCAGA CGGATGTGAC TCACATCATC
GATCTGCCCA TGCGGGCGAC GCTGATGTCG GTGACGGGGA ATGGCACGAA CCTGTCGTTT
CAGATCCAGG GCGAAGGCAC GGTTGTCATC GATCTTTCCG ATCCGACCAA CAAGAGCGTC
CAGGTGTCTG GCGCCACCAT CGTTTCCCAG GTCGGAGACA AGCTTACCAT CGATATCGGG
CCGGTCGGGT CGCACACCGT TACCGTCACC CAGACTTCGC TCAACCACGC GCCGGTTATC
GAGTCCAATG GCGGTGGAGA CACCGCGGCG ATTTCGCTGG CGGAGAACCT CCTCGCAGTG
ACTGCGGTCA TCGCGACCGA CGCTGATGCC AATGCCCTGA CCTACTCGAT CACGGGAGGG
GCGGACGCAT CGAAGTTCAC GATCAATGCG ACGACGGGTG CGCTTGCGTT TCTGGCCGCG
CCGAACTTCG AAGTCCCGAC CGATGTCGGC GGCAACAACG TATACGATGT CGTGGTGACT
GCATCCGACG GAGCGCTCAC CGATAGCCAG GCGCTGGCCG TGACGGTCAC AAACGTCAAC
GAGGCGCCGG TAATCACGTC GAATGGCGGC GGTGCGACCG CCTCTATCTC GCTTGCCGAG
AACAACGCGG CGGTCACCGT GGTGACCTCG ACCGATCCGG AAAACACCGC GCGGACGTAT
TCGCTCTCGG GTACGGACGC TGCTCGCTTC ACGATCGACG CCGCGACCGG CGCGCTCAGT
TTCGTCAACG CGCCAGATTT CGAAAACCCC ACGGATGTGG GGGCCAACAA CGTCTACAAC
GTGGTCGTGA CCGCTTCCGA CGGCAGCCTG ACGGATACCC AGGCACTGGC AATCACCGTC
ACGAACAAGA AGGGCGTAAC CCTCAATGCT TCGTCGAGCA CCGGCAGCGT TCTGAACGGG
ACGGGCGAGG AAGACCAGCT CAATGGCTGG AAGGGTGCCG ATACCCTCTA CGGTCTCGGC
GGTAACGACC GTCTCGACGG TGCGGGCGGA AACGACCGCC TTTATGGCGG TGATGGCAAG
GACGTTCTGA TCGGCGGCGC CGGTACGGAT ATCATGTCTG GCGGGGCTGG TGCGGACCGC
TTCGAGTTCA ACGCGCTGGG GAACAGCGTG ACGGGTGCAT TGCACGACGT CATCACCGAC
TTCGAAGCGG GCATCGACTT GATCGATGTG TCGAGCATCG ACGCGAATTC CGGCAAAGGC
GGGAACCAGA CCTTTGTCCT GCTAGCGGAA GGTGCGGCGT TCACGGGGGT CGGGCAACTT
CGTTACTTCT ACGACAGTGC GACCGACCAG ACGATTGTCC AAGGTAACGT GAACAACAAT
CTGGCAGCCG ATTTCGAAAT CGCATTGTCC GGACATCAAA CCCTGTCCGC AAGCATGTTC
ATCCTCTGA
 
Protein sequence
MTYIQIDGSL SDWSSNLRID AGAVDGYQIY ATTDATDYIF AFAAPTAVGA NTTIWLNTDL 
NQATGYQLWG TVGAEFNVNF KSDGSAALYS GAAGGTLVAD NLVLAYNADK TMVELRVPKD
LLGNPGSIDT VYDINDTAII PSFYQDNAIR VWDDSELASV IPATDTRIAI VYSATTAANY
FSQTAYSDLF MAAQSQAAQA GVPFDIITEA DLTDINKLAQ YKAIVFPSFR NVQASQADEI
AHTLQLASQE FHVGFIVSGE FMTNDENGNA MAGNSYSRMA TLLDATRVTG GTATSLTVTA
TDPTGVVLDG YANGELVNQY ANVGWNAFQS VSGTGQTIAT ETINGSSTYA AVLATQTGGR
NVLFSSDAVM ADANMLQRAI DYAVSGETVT VSLNMTRDAG LVAARVDMDQ SMYIEDVNGG
IYDQLVPLLQ QWKAQYNFVG SFYVNIGDNT QQGIYTDWNK SLPYYTAMIG LGNEIGTHTY
THPEDTNLLS PSQLQFEFEL STQILEQKLS AALGYAYTIE GAAIPGAPET LTTSLAIEQY
VKTYLTGGYT GQGAGYPNAF GYLTPGSQDK VYIAPNTFFD FTLFDWLHLS AADASALWQS
QYEKIVSQAD SPVVVWPWHD YGATAFNSPN YAPEIFNTFL AQAAADGMEF VTLADLANRI
NAFHGAKVTT SVSGNTITAN VTASGNVGTF AFDLQGQGSQ VISSVAGWYA YDSDSVFLPQ
NGGTFVITLG AAQTDVTHII DLPMRATLMS VTGNGTNLSF QIQGEGTVVI DLSDPTNKSV
QVSGATIVSQ VGDKLTIDIG PVGSHTVTVT QTSLNHAPVI ESNGGGDTAA ISLAENLLAV
TAVIATDADA NALTYSITGG ADASKFTINA TTGALAFLAA PNFEVPTDVG GNNVYDVVVT
ASDGALTDSQ ALAVTVTNVN EAPVITSNGG GATASISLAE NNAAVTVVTS TDPENTARTY
SLSGTDAARF TIDAATGALS FVNAPDFENP TDVGANNVYN VVVTASDGSL TDTQALAITV
TNKKGVTLNA SSSTGSVLNG TGEEDQLNGW KGADTLYGLG GNDRLDGAGG NDRLYGGDGK
DVLIGGAGTD IMSGGAGADR FEFNALGNSV TGALHDVITD FEAGIDLIDV SSIDANSGKG
GNQTFVLLAE GAAFTGVGQL RYFYDSATDQ TIVQGNVNNN LAADFEIALS GHQTLSASMF
IL