Gene Saro_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2500 
Symbol 
ID3916821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2700925 
End bp2702313 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID640445257 
Producthypothetical protein 
Protein accessionYP_497770 
Protein GI87200513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.967827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGGA CGGGAATGTA TGTTCTGCAG GGAATTGTTC CTTCGGAAAT TGCTGTAGCT 
CCGTTTGATG TCAAGGTAAT AGACATCTAC AACGATGACG GCGTAGCTTT CACGCCCACG
CAGGTTGCGC AGATGGGCGG CGGACCGGGC AGTGCATTAC TTCTCGGATA CTTCAGCATT
GGTGAAGCAG AGGTCTATCG AGACTACTTC AACACCATTC CGAAATCGGC CCTCGGACCT
GAAAATCCAC AGTGGGCCGG TAACTATCAG GTCGCCTACT GGACGGCCGA ATGGCGCACC
GTGGCAACCG CCTACATCGA CCGGATCATC GCTGCCGGCT ATGATGGCGT CTACTTCGAC
GTCGTCGACG AATACCAGCA GAAGTGGGCG CAGACCTATT GCCCGGGCGG CGCGGCAGGT
GCCGAACAAG CCATGGCGGA TCTGGTCGCC TACCTCGCCG ACTATGCCCA TGCCAAGAAC
CCGGCGTTCA AGATCTGGGC AAACAACGCC GAGGAACTGC TGACCAACCA GACATATTTC
AGCCACCTCG ACGGCATGTT CAAGGAGAAC CTGTTCTATA CGGACAGCGG TTCGAAGCAG
CCTTCGAGCG AGACGCAGTA CAGCATGAGC CTCCTCCAGA TGATGCTGGC GGCGGGCAAG
GATGTCATCG CCATCGAGTA CGTTTCGGAC TCGGCAAAGA TCGCCGACGT GGAAACGCAG
GCCGCGCACT ACAACGTCGG CTACTACACC GCGGACATCA ATCTCGACGG CATCAGCTAT
ACCGGCGTGC TGCCCGGCCA GTACATCCAC GAAGACTGGA GCGGTCTGAC GACGACCACG
ACGACGACCA GCACCAGCAC CACCTCTACG TCGACAACCA CCTCCACCAC GACAACGTCG
ACCACCACGC TGGTGACCGA TCTCACCCTG ACCGGGACGA GCAGTTCCGA CAACCTGGCT
GGCAAGTCCG GCAACGACAA GCTCTACGGC AAAGCCGGTG CCGACGTGCT GTCAGGCAAC
GGCGGCAACG ACTGGCTCGA AGGCGGCAAC GGCAACGACA AGCTTGCCGG CGGCGCGGGA
GCGGACTCCT TCGTGTTCCG CGCCTATGGC AACAAGCACA GGGACGCGAT CGCCGACTAC
CAGGCCGGCA CGGACCATAT CGTGCTCGAC CACGCGGTGT TCACTGCGGC GGGCGCCATC
GGCACCTTGA GCGATGCCGG ATTCTGGATC GGATCGGCAG CCCACGATGC CAGCGACCGG
ATCATCTACA ACGCATCCAC TGGATTGATC AGCTACGATG CGGACGGCAC CGGCAGACTC
TCGGCCGTCG CCATCGCCAC CGTCGCGCCC GGCACGCTGC TGACGCATGC CGACTTCCTG
ATCATCTGA
 
Protein sequence
MARTGMYVLQ GIVPSEIAVA PFDVKVIDIY NDDGVAFTPT QVAQMGGGPG SALLLGYFSI 
GEAEVYRDYF NTIPKSALGP ENPQWAGNYQ VAYWTAEWRT VATAYIDRII AAGYDGVYFD
VVDEYQQKWA QTYCPGGAAG AEQAMADLVA YLADYAHAKN PAFKIWANNA EELLTNQTYF
SHLDGMFKEN LFYTDSGSKQ PSSETQYSMS LLQMMLAAGK DVIAIEYVSD SAKIADVETQ
AAHYNVGYYT ADINLDGISY TGVLPGQYIH EDWSGLTTTT TTTSTSTTST STTTSTTTTS
TTTLVTDLTL TGTSSSDNLA GKSGNDKLYG KAGADVLSGN GGNDWLEGGN GNDKLAGGAG
ADSFVFRAYG NKHRDAIADY QAGTDHIVLD HAVFTAAGAI GTLSDAGFWI GSAAHDASDR
IIYNASTGLI SYDADGTGRL SAVAIATVAP GTLLTHADFL II