Gene Saro_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1891 
Symbol 
ID3917112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2001334 
End bp2002968 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content64% 
IMG OID640444635 
Productalpha amylase, catalytic region 
Protein accessionYP_497165 
Protein GI87199908 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.687574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGA CCGAAACAGC CATTGCCGAC CCCAAAATGC ACACGGAGAC GCCGTGGTGG 
CGCGGTGCGG CGATCTATCA GATCTATCCA CGCAGCTTTT GCGATTCCAA CGGCGACGGC
ATCGGGGACC TGAATGGCAT CGCCTCGCGA CTGGATCACG TCGCGCGCCT TGGCGTAGAC
GCGATCTGGA TCTCGCCGTT CTTCACCTCG CCGATGAAGG ACTTCGGCTA CGACGTCGCC
GACTACTGCG ACGTCGATCC GATCTTCGGG ACGCTGGCGG ATTTCGATGC CCTGGTAAAA
CGCGCGCACG AGCTGGGCCT CAAGGTCACG ATCGACCAGG TCTATGCGCA TACCTCGGAC
ATTCATCCGT GGTTTGCCGA AAGCCGGCAG GACCGTACCA ACGCCAGGGC GGACTGGTAT
GTCTGGGCCG ATCCTAAGCC CGATGGGTCG CCCCCGTCGA ACTGGCAGTC GGTATTCGGC
GGTCCGGCAT GGACGTGGGA TGCGCGGCGC TGCCAGTATT ACCTGCACAA CTTCCTATCC
AGCCAGCCCC AGGTAAACGC CCACAATCCC GAAGTGCAGG AAGCGCTGCT CGGGGCGATG
AAGTTCTGGC TGGACCGGGG TGTTGACGGC TTCCGCCTCG ATGCCCTGAA CTTCCTGATG
CATGACCCGA CCTTGCGGGA CAATCCGCCG GCGCCGGACG ATGGCCGACG CAAGACGCGG
CCATTCGACT TCCAGCTCAA GATCTACAAC CAGTCGCACC CCGACATCCT GAAGTTCATC
CAGCGGGTGC GGAACCTGTG CGACGACTAT GGCGCGGTGT TCACAGTCGC GGAAGTCGGC
GGAGACCTTG CCGAGACGGA GATGAAGGCG TTCACTGCGG GGGACAGGCA TCTCAACAGC
GCCTACGGCT TCGACTTCCT CTACGCGGAC AGGCTGACGC CGCATTTCGT GGAAAAGGCC
GTGGCCAAAT GGCCCGATGC ACCCGGCATG GGCTGGCCGA GCTGGGCCTT CGAGAACCAC
GATGCGCCGC GCGCGCTTTC GCGCTGGTGC GCGCCGGACC AGCGCGAGCC GTTCGCCCGG
CTGAAGGCCA TGCTCTTCGC TTCGTTGCGG GGGAACATCA TCGTCTACCA GGGCGAGGAA
CTGGGACTGA CGCAGGTCGA CATTCCGTTC GAGCAGTTGC AGGACCCCGA GGCCATCGCC
AATTGGCCGC TGACGCTTTC ACGCGACGGT GCACGCACGC CGATGCCCTG GTTAGTACAA
TCGGGCGAGG GCGGGTTCAC GTCGGGCGCG CCCTGGTTGC CGCTGGGCGA GGAAAACCTG
TCGCGGGCCG TCGACCGGCA GGAAGGCGAT CCGGCATCGC TCTTGAACCT GACCACGCGC
CTGCTTCGCC TGCGCCGCGA GACGCCCGCG TTACGGATCG GTTCCTTCGA GGTGATCCAT
GCAGACGAAT GCCTTCTTGC CATTCGGCGC GTTTTGGGTG AGCAATCGAT TGCAGGGCTG
TTCAACCTGT CCTCCGTCCC GGTGGTCTGG CCCCACGGGC TGGTGCGGGA GGGCAAGGAA
ATGGCGTCGG TCAACGGCGC GACAGTGGGG CAGTTGCCGC CGTTCGGCGC GCTCCTGATC
GAAGAGAGGA TCTGA
 
Protein sequence
MKQTETAIAD PKMHTETPWW RGAAIYQIYP RSFCDSNGDG IGDLNGIASR LDHVARLGVD 
AIWISPFFTS PMKDFGYDVA DYCDVDPIFG TLADFDALVK RAHELGLKVT IDQVYAHTSD
IHPWFAESRQ DRTNARADWY VWADPKPDGS PPSNWQSVFG GPAWTWDARR CQYYLHNFLS
SQPQVNAHNP EVQEALLGAM KFWLDRGVDG FRLDALNFLM HDPTLRDNPP APDDGRRKTR
PFDFQLKIYN QSHPDILKFI QRVRNLCDDY GAVFTVAEVG GDLAETEMKA FTAGDRHLNS
AYGFDFLYAD RLTPHFVEKA VAKWPDAPGM GWPSWAFENH DAPRALSRWC APDQREPFAR
LKAMLFASLR GNIIVYQGEE LGLTQVDIPF EQLQDPEAIA NWPLTLSRDG ARTPMPWLVQ
SGEGGFTSGA PWLPLGEENL SRAVDRQEGD PASLLNLTTR LLRLRRETPA LRIGSFEVIH
ADECLLAIRR VLGEQSIAGL FNLSSVPVVW PHGLVREGKE MASVNGATVG QLPPFGALLI
EERI