Gene Saro_2942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2942 
Symbol 
ID3917377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3158785 
End bp3160533 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content67% 
IMG OID640445720 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_498211 
Protein GI87200954 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGC ATGTCGAAGT CCGCAATCTG CGCATCGGCG TTGGCGACAA GGCCATCGTC 
GACGGCGTCT CGTTCGAGAT CCCGCGGGGC GAAGTTCTTG CGCTGATTGG CGAATCCGGC
TCCGGCAAGA CTACGATCGC CCTTTCGCTG ATGGGTCATG CCCGTTTTGG TGCGAAGATC
GAGGGCGAGA TCCGCCTTGG CGATACCCGC ATCGACCAAC TCGATGAAGC GGGTCTCCAG
GCCTTGCGTG GGCGGCGCGT CGCTTACGTC GCGCAAAGCG CCGCGTCCGC GTTCAACCCC
TCGCTCACGA TCATGACACA AGTGACCGAG CCCTTGCTCG TGCACGGCCT TGCGACGAGG
GCGGAAGCTG AGGCCAAGGC CGTGGCCTTG TTCAAGGCGC TTGCGCTGCC GCATGCCGAA
ACCATCGGTG CTCGCTATCC GCACCAGCTT TCGGGCGGCC AGCTCCAGCG GCTGATGGCG
GCTATGGCGC TGATCACCGG CCCGGAACTC GTCATCCTCG ACGAACCCAC CACAGCCCTT
GACGTGACCA CGCAGGTCGA AGTCCTGAGG GCATTCAAGG CCGCAATCGG CGCGGTCGGG
GCAACCGCGA TCTACGTCAG TCACGACCTT GCCGTCGTCG CGCAGATGGC CGACCGGATT
CTCGTGCTGA ACCAGGGGCA GACGCGCGAA CAGGGCGCGG CGGAGCAGAT CCTCCACGCC
CCGCAGGACG ACTATACCCG CACGCTGATG GCGGCCGCGC GTCCCCACGC TCGCACCGCC
CCCGCGAGGG TTGCCGATGT GCCGTTGCTG CGCGTCGAAG GCGTCCACGC CGCCTATGGC
AAGGTGCCTG TGCTGCGCGA CATCTCGCTC AACCTGGCGA AAGGCGCGAC GCTTGGCGTG
ATCGGCGAAA GCGGGTCGGG AAAGTCGACC CTTGCCCGCG TGATCGCCGG TCTCCTGCCG
CGCAGTGCCG GTTCGGTCAG CGTCGATGGC GAAGAGCTGC CGCGCGGTCT CGACGGCCGC
AGCCGCGAGC AGTTCCGGCG CGTGCAGCTT GTCTTCCAGA ATGCCGACAC GGCGCTGAAC
CCGGTCCATA CCGTCGGCCA GACGCTGGCG CGGCCGCTGG CGTTCTATCA TGGTCTCACT
GGCGCCGAAG GCAGGGCGGA AGTTGCGCGT CTGCTCGATC TGATTCGCCT GCCCGCCGCT
TTCGCCGACC GCAATGTCCG CCAGCTCTCC GGCGGCCAGA AGCAGCGCGT CAATCTTGCC
CGGGCTCTCG CCGCGCGGCC CGACGTGCTG TTGTGCGACG AGGTCACGTC CGCGCTGGAT
ACCGTCGTGG GCGCGGCGAT CCTCGAACTG ATCGACGAAC TGCGCCGGGA TCTCGGCATC
GCCACCGTGT TCATCAGCCA CGACATTTCG ACCGTCCGTG CCTTTTGCGA CAAGGTGCTG
GTGCTTTATG GCGGCACGGC CGTGGAGCAG GCCGATGCCG CAGCTTTCGC GCGCGGCCCG
CACCATCCCT ACACGACGCT GCTCATGGAT TCGGTGCCCG AGATGCGTGC CGGCTGGCTG
GAGCAGGCCG GAGCCCGCCC CGCGGCGCTG GCAGCATCTG ACCTCGACGG GCTCTGCCGC
TTCCTGGGAC GATGTCCCGT CGCCATTTCC GGCGCGTGCG ACCGCCAGGC CCCGCCCGCG
CGGACAGGAG ACGGTCTTGC GCTGCTTTGC CATCACGACT TTGAACGACT GGGGGAATTG
ACCGCATGA
 
Protein sequence
MSAHVEVRNL RIGVGDKAIV DGVSFEIPRG EVLALIGESG SGKTTIALSL MGHARFGAKI 
EGEIRLGDTR IDQLDEAGLQ ALRGRRVAYV AQSAASAFNP SLTIMTQVTE PLLVHGLATR
AEAEAKAVAL FKALALPHAE TIGARYPHQL SGGQLQRLMA AMALITGPEL VILDEPTTAL
DVTTQVEVLR AFKAAIGAVG ATAIYVSHDL AVVAQMADRI LVLNQGQTRE QGAAEQILHA
PQDDYTRTLM AAARPHARTA PARVADVPLL RVEGVHAAYG KVPVLRDISL NLAKGATLGV
IGESGSGKST LARVIAGLLP RSAGSVSVDG EELPRGLDGR SREQFRRVQL VFQNADTALN
PVHTVGQTLA RPLAFYHGLT GAEGRAEVAR LLDLIRLPAA FADRNVRQLS GGQKQRVNLA
RALAARPDVL LCDEVTSALD TVVGAAILEL IDELRRDLGI ATVFISHDIS TVRAFCDKVL
VLYGGTAVEQ ADAAAFARGP HHPYTTLLMD SVPEMRAGWL EQAGARPAAL AASDLDGLCR
FLGRCPVAIS GACDRQAPPA RTGDGLALLC HHDFERLGEL TA