Gene Saro_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2085 
Symbol 
ID3917733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2221892 
End bp2222962 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID640444838 
Productextracellular ligand-binding receptor 
Protein accessionYP_497358 
Protein GI87200101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.137371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTCCCA AGGGTCCGGC CGGCCCTGCT CCGGAACCGA CCGAGAAGGC CCCGGACGCC 
AATGTCCTGC CGACCGACGC GGGCCGTCAC CGCGTGGCGC TGCTCGTGCC GCTGACCGGG
GCCAACGCCG CTGTCGGCCA GTCCATCGCC AATGCCGCGA CGATGGCCCT GCTCGATACC
AACGCCGCCA ACATGCGCAT CACCACCTAC GACACGGCCA GCGGCGCCGG CTCGGCCGCG
AGCCGGGCCA TTTCGGACGG CAACAAGCTG ATCCTCGGCC CGCTTCTCGG TAACGACGTC
GTACTCGTCA GCAATGTCGC CCGGCCCGCC AAGGTGCCGA TGATCACCTA TTCCAACGAC
AGCGCGGTCG CCTCGCGCGA TGTCTTCGTC ATGGGCCAGG CCCCCGGCCA GTCGGTCGCC
CGCGTACTCG GCTTCGCCAA GTCGAAAGGC ATCGGTTCGG TCGCCGCGAT CATCCCGACG
GGCGACTACG GCCAGCGCGC GATGAACGCG GTCGTCGATT CCGGCCGCGC CCTCGGCGTC
ACGGTCACCG CCATCGAGAC CTACGACCGC GGCAATACCT CGGTCGCCAG CGCCGTGCGC
CGGGTCAAGG AGAAGGGCCG CTTCGACGCC CTGCTCATTG CCGACGGCAG CCGCATAGCG
CTTCAGGCCG CCCCGCTCGC CGGCAAGGGC GTCAAGCTGC TCGGTACCGA ACTGTGGAGC
GGCGAAGCGG CGATCGCCAA GAGCCCCGCC ACGCGCGGCT CATGGTTCGC CGCCGTTTCG
GACGGGCGCT TCGGCCAGTT CGAGAAGTCC TACCGCACCC GCTTCGGCGC CACCCCCTCG
CGGCTGGCAA CGCTCGGCTA CGACAGTGTC CTGCTGACGC TGAACGTCGC GCGCAACTGG
AAGCCGGGAA CGACCTTCCC CACCGCCAAA CTCTACGATC CGCAGGGCTT CATCGGCCTC
GACGGCGTCT TCCGCTTCAC CGCCTCGGGC ATGGCAGAGC GGGCGATGGA AGTGCGCGAA
GTCGGCGCGG GCACCTTCAC CACCGTTTCC CCCGCTCCTG CAAAGTTCTG A
 
Protein sequence
MIPKGPAGPA PEPTEKAPDA NVLPTDAGRH RVALLVPLTG ANAAVGQSIA NAATMALLDT 
NAANMRITTY DTASGAGSAA SRAISDGNKL ILGPLLGNDV VLVSNVARPA KVPMITYSND
SAVASRDVFV MGQAPGQSVA RVLGFAKSKG IGSVAAIIPT GDYGQRAMNA VVDSGRALGV
TVTAIETYDR GNTSVASAVR RVKEKGRFDA LLIADGSRIA LQAAPLAGKG VKLLGTELWS
GEAAIAKSPA TRGSWFAAVS DGRFGQFEKS YRTRFGATPS RLATLGYDSV LLTLNVARNW
KPGTTFPTAK LYDPQGFIGL DGVFRFTASG MAERAMEVRE VGAGTFTTVS PAPAKF