Gene Saro_2351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2351 
Symbol 
ID3915696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2498081 
End bp2499760 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content63% 
IMG OID640445107 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_497622 
Protein GI87200365 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.701309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCCC AATACGCCTA CGTCATGAAG AACATGACGA AGACCTTCCC CGGCGCGCAG 
AAGCCGGTGC TCAACAACAT CAGCCTGCAG TTCTACCAGG GCGCCAAGAT CGGCATTGTC
GGCCCGAACG GCGCCGGCAA GTCGACCCTG ATGAAGATCA TGGGCGGGAT CGACACCGAC
TTCACGGGCG AGGCCTGGCC GGGCGAGAAC ATCACCGTCG GCTACCTGCC GCAGGAACCG
CAGCTCGACA CGAGCAAGAC CGTGCTCGAG AACGTCAAGG ACGGCGCGCG CGAGACGGCA
GACCTCGTCG ACCGGTTCAA CGAGATCTCC AACATCATGG CCGATCCGCC GGAGGACGTC
GATTTCGACG CGCTGATGGA AGAGATGGGC GAATTGCAGG GCAAGATCGA CGCTGTCGAC
GGCTGGACGC TCGACAACCA GCTCGAGATC GCGATGGAAG CGCTGCGCTG CCCTCCCGGC
GACTGGCCGG TGGACAGCCT TTCGGGCGGT GAAAAGCGCC GCATCGCGCT GACCCGCCTG
CTGATCCAGA AGCCGTCGAT CCTGCTGCTC GACGAACCGA CCAACCACCT CGACGCCGAA
AGCGTCGAAT GGCTTGAGAA CCACCTCAAG GAATATGCCG GCGCGGTTCT GATGATCACC
CACGACCGCT ACTTCCTCGA CAACGTGGTG GGCTGGATCC TCGAACTCGA CCGCGGAAAG
TACTTCCCGT ACGAGGGCAA CTACTCGACC TACCTCGAGA CCAAGGCCAA GCGCCTGGCG
CAGGAAGAGC GCGAGGAAAG CGGCAAGCAG AAGGCGCTCG CCCGCGAACT CGAGTGGATC
CGGCAGACCC CGGCCGCGCG CCAGACCAAG TCCAAGGCGC GTATCCGCAA GTTCGAGGAA
CTGCAGAACG CACAGGACAA CCGCGCCGTC GGCAAGGCCC AGATCGTCAT CCAGGTGCCC
GAGCGCCTGG GCGGCAAGGT CATCGAGGCG AAGAACATCT CGAAGGCCTA TGGCGACAAG
CTGCTGTTCG AGGACCTTTC GTTCATCCTG CCGCCGGGCG GCATCGTCGG CGTCATCGGT
CCGAACGGCG CGGGCAAGTC CACGCTGTTC AAGATCATCA CCGGCAAGGA GCAGCCCGAC
AGCGGCACGA TCGAGATCGG CTCGACCGTG CATCTGGGCT ATGTCGACCA GAGCCGCGAC
CATCTCGACC CGAAGAAGAA CGTCTGGGAG GAAATCTCGG ACGGCCTCGA CTACATGAAG
GTCAACGGCC AGGACATGTC GACGCGCGCC TATGTCGGTG CGTTCAACTT CAAGGGCCAG
GACCAGCAGA AGAACGTCGG CAAGCTTTCA GGCGGTGAGC GCAATCGCGT CCACATGGCC
AAGATGCTCA AGGAGGGTGG CAACGTCCTC CTGCTCGACG AACCGACCAA CGACCTCGAC
GTCGAAACGC TGGCCGCACT GGAAGACGCG ATCGAAAACT TCGCCGGTTG CGCCGTGGTC
ATCTCGCACG ACCGCTTCTT CCTCGACCGT CTGGCGACGC ATATCCTCGC GTTCGAGGGC
AACAGCCACG TCGAATGGTT CGAAGGCAAC TTCGCCGCCT ATGAAGAAGA CAAGCGCCGC
CGCCTTGGCG ATGCGGCCGA CCGGCCGACG CGCCTGGCGT ACAAGAAGCT GACGCGCTGA
 
Protein sequence
MAAQYAYVMK NMTKTFPGAQ KPVLNNISLQ FYQGAKIGIV GPNGAGKSTL MKIMGGIDTD 
FTGEAWPGEN ITVGYLPQEP QLDTSKTVLE NVKDGARETA DLVDRFNEIS NIMADPPEDV
DFDALMEEMG ELQGKIDAVD GWTLDNQLEI AMEALRCPPG DWPVDSLSGG EKRRIALTRL
LIQKPSILLL DEPTNHLDAE SVEWLENHLK EYAGAVLMIT HDRYFLDNVV GWILELDRGK
YFPYEGNYST YLETKAKRLA QEEREESGKQ KALARELEWI RQTPAARQTK SKARIRKFEE
LQNAQDNRAV GKAQIVIQVP ERLGGKVIEA KNISKAYGDK LLFEDLSFIL PPGGIVGVIG
PNGAGKSTLF KIITGKEQPD SGTIEIGSTV HLGYVDQSRD HLDPKKNVWE EISDGLDYMK
VNGQDMSTRA YVGAFNFKGQ DQQKNVGKLS GGERNRVHMA KMLKEGGNVL LLDEPTNDLD
VETLAALEDA IENFAGCAVV ISHDRFFLDR LATHILAFEG NSHVEWFEGN FAAYEEDKRR
RLGDAADRPT RLAYKKLTR