Gene Saro_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2068 
Symbol 
ID3917715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2207602 
End bp2209398 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content67% 
IMG OID640444820 
ProductABC transporter related 
Protein accessionYP_497341 
Protein GI87200084 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAG CACCGATCCT CAGCTGGGAA GGCCTTGGAC TTCTCCAGGG CAATGGCTGG 
CTTTTTCGCG ACCTCGACAT CCACATCGGC CCGCGCGACC GCCTGGCGCT GATCGGGCGG
AACGGCGCGG GCAAGACCAC GCTGCTCAAG CTGCTGGGCG GACAGATCGA TGCGGACAAG
GGTACCCGCT CGATCCAGCC CGGCACCAGG ATCGTGACGC TGGAGCAGGA CCCCTTCTTC
ACCGGCTATG ACACGCTGAT GGACTTCGCG CTGTCGGGCA AGGACGCGCC GGCCCGACAC
GAGGTCGAAT CGATTGCCGG GCAGCTCGGC ATCGACATGA GCCGCAAGGC GGACAGCGCC
AGCGGTGGCG AGCGGCGTCG GGCGGCCCTC GCCCGCGCAC TGGCAAGCGA GCCGGACCTG
CTCCTGCTCG ACGAGCCGAC CAACCACCTC GACCTTGCCG CCATCGACTG GCTGGAGGAC
TGGCTCCAGC GGTTCAAGGG TGCGTTCGTG GTGATCAGCC ACGACCGCAC CTTCCTCGAA
CGCCTGACCA GGGCGACGCT CTGGCTCGAC CGTGGATCGT TGCGCCGCAA GGACATTGGC
TTTGGCGGGT ACGAGGCCTG GATGGAACAG GTCTATGCCG AGGAAGCCCG CGCCGCCGAC
AAGCTCGACG CCAAGCTGAA GATCGAAGCC CACTGGCTGG AACGCGGCGT CACCGCGCGG
CGCAAGCGCA ACATGGGCCG CCTCGAAAAG CTTTATGAAA TGCGCGCGCA GCGGGCGGCG
ATGCTCTCGC CGCAGGGCAC CGCAAAGCTC GCCATCGCCA GCGACGATGC CAAGAGCAAG
GCGGTGATCG TCGCCGACCA CGTCAACAAG TCCTTCGGCG ATCGCCCGAT CGTCAAGGAC
TTCACCCTGC GCATCACGCG CAAGGACCGC ATCGGCGTCG TCGGATCGAA TGGCGCGGGC
AAGACCACGC TGCTCAAGCT CCTGACCGGC GAACTCGCGC CCGACAGCGG CACCGTGACG
CTGGCCAAGA CCCTCCAGGG CGTGATGATC GACCAGCAGC GCAGCCTGAT GGCGCCGGAA
AAGCGCGTGC GCGACGTGCT GGCCGATGGC AGCGACTGGA TCGACGTGCG CGGGGTCCGC
AAGCACATCC AGGGCTATCT CAAGGACTTC CTGTTCGATC CCGGCCTTGT CGAGGCGCGC
GTCGGCACGC TTTCGGGCGG CGAGCGGTCG CGCCTCCTGC TGGCACGCGA ATTCGCGCGC
AAGTCCAACC TGCTGGTGCT GGACGAGCCG ACCAACGACC TCGACCTGGA AACGCTGGAT
CTGCTCCAGG AAGTGATAGC GGACTATGAC GGCACGGTGC TGATCGTCAG CCACGACCGC
GACTTCCTCG ACCGCACGGT CACGATCACG CTGGGCATGG ACGGTTCGGG CCGGGTCGAT
ATCGTCGCTG GCGGCTATGC CGACTGGGAA AAGATGCGCA AGAGCAGAGG CGCGGGCGCT
GCAAAGGCGG CATCGCCCCG GGAAGCCGGA GCCCCTCCAC CGCCTCCACC GCCGCCGCCG
GCGAAGAAGG GCAAGCTTTC CTACAAGGAC CAGCGCGACT ACGAACTTCT GCCGACGCGC
ATCGAGGAAC TCGAGGCAGC AATCGCGCGT GGCGAAGCCC AGTTGGCCGA CCCGGACCTC
TACGCCAGGG ACCCGAAGAA GTTCGACGCG CTGATGGCGG CGCTGGAAAA GGTGCGGGGC
GAGAAGGAAG CAGCCGAGGA GCGCTGGCTG GAACTGGCCG AAATGGTCGA GGGCTGA
 
Protein sequence
MAAAPILSWE GLGLLQGNGW LFRDLDIHIG PRDRLALIGR NGAGKTTLLK LLGGQIDADK 
GTRSIQPGTR IVTLEQDPFF TGYDTLMDFA LSGKDAPARH EVESIAGQLG IDMSRKADSA
SGGERRRAAL ARALASEPDL LLLDEPTNHL DLAAIDWLED WLQRFKGAFV VISHDRTFLE
RLTRATLWLD RGSLRRKDIG FGGYEAWMEQ VYAEEARAAD KLDAKLKIEA HWLERGVTAR
RKRNMGRLEK LYEMRAQRAA MLSPQGTAKL AIASDDAKSK AVIVADHVNK SFGDRPIVKD
FTLRITRKDR IGVVGSNGAG KTTLLKLLTG ELAPDSGTVT LAKTLQGVMI DQQRSLMAPE
KRVRDVLADG SDWIDVRGVR KHIQGYLKDF LFDPGLVEAR VGTLSGGERS RLLLAREFAR
KSNLLVLDEP TNDLDLETLD LLQEVIADYD GTVLIVSHDR DFLDRTVTIT LGMDGSGRVD
IVAGGYADWE KMRKSRGAGA AKAASPREAG APPPPPPPPP AKKGKLSYKD QRDYELLPTR
IEELEAAIAR GEAQLADPDL YARDPKKFDA LMAALEKVRG EKEAAEERWL ELAEMVEG