Gene Saro_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0530 
Symbol 
ID3918660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp573909 
End bp575255 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content63% 
IMG OID640443260 
Producthypothetical protein 
Protein accessionYP_495811 
Protein GI87198554 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGAA CGGGAATGTA TGTACTTCAG GGTGTCGATC CGGCGGCAAT TGCCGAGGCG 
CCCTTCGACG TGAAGGTCGT CGACATCTAC AACGACAGCG GGGCGATGTT CACGCCCACC
CAGGTTGCCC AGATGGGTGG CGGGCCGGGT TCCGCACTCT TGCTCGGTTA CTTCAGCATC
GGCGAGGCGG AAGTCTACCG GGACTATTTC AGCAGCATTC CCAAGTCCGC GCTCGGGCCG
GAAAACCCGC AGTGGAAGGG CAACTACGAG GTTGCCTTCT GGACCCAGGA ATGGCGCACA
GTCGCCACCA CCTACCTCGA TAAGGTGATC GCCGCCGGCT ACGACGGGAT CTACTTCGAC
GTGGTGGACG AGTACCAGCA GGCCTGGGCC CAAGCCAACT GCCCGGGCGG TGCGGCCGGT
GCAGAGCAGG CGATGGCCGA CCTTGTAGCC TATCTGGCCG ACTATGCCCA TGCCAAGAAC
CCCGCGTTCA AGATCTGGGC AAACAACGCC GAGGAACTGC TGACCAACCA GACATATTTC
AGCCACCTCG ACGGCATGTT CAAGGAGAAC CTGTTCTACA CCGATTCGGG CTCGGCTCAG
CCCAAGTCCG AGACCAGCTA CAGCCTGCAA ATGATGCAAA GCATGCTGGC GGCGGGCAAG
GACGTGATCG CGATCGAATA TGTCTCAGAC CCGGCCAAGG TCACGGACGT CGAGGCCAAG
GCCGCGGCGG CGGGCGTGGG CTACTACACC GCCGACATCG ACCTCAACGG GATCAGCTAT
ACCGGCGTGC TGCCGGGGCA GGTGATCCAC GAGGACTGGA GCGGCTTGAC CCCGACGACC
ACGACCACAA CGACGACGAC CGCGCCCCCG CCGGCCGACC TCGTGTTGAA CGGCACGAGC
TATGCCGATA CGCTGACCGG GCAAGGCGGC AACGACAAGC TCTTCGGTTT TGCGGGCAAG
GACGTGCTGG CCGGCAACGG CGGCAATGAC TGGCTCGAGG GTGGTAATGG CAACGACCGG
CTCACCGGTG GTGCCGGAGC GGATTCGTTC GTGTTCCGGG CCAGCGGATC CAAGCATATG
GATACGATCA CGGATTTCCA GCCGGGCATC GACCATATCG TGCTCGACCG CGCCGTGTTC
ACGAAGGTCG GCGCGGCGGG TGCGCTGGCA GATGCCGCTT TCTGGGAAGG CTCGAAGGCG
CACGATGCGA GCGACCGGAT CACCTACGAT TCGGTGACCG GCACGATTAG CTACGACAGC
GACGGGACCG GCTGGCACAG CGCGGTTGTC GTCGCGACGT TGGCGCCTGG TCTCCACCTG
ACCGCCAGCG ATTTCATTGT AATCTGA
 
Protein sequence
MARTGMYVLQ GVDPAAIAEA PFDVKVVDIY NDSGAMFTPT QVAQMGGGPG SALLLGYFSI 
GEAEVYRDYF SSIPKSALGP ENPQWKGNYE VAFWTQEWRT VATTYLDKVI AAGYDGIYFD
VVDEYQQAWA QANCPGGAAG AEQAMADLVA YLADYAHAKN PAFKIWANNA EELLTNQTYF
SHLDGMFKEN LFYTDSGSAQ PKSETSYSLQ MMQSMLAAGK DVIAIEYVSD PAKVTDVEAK
AAAAGVGYYT ADIDLNGISY TGVLPGQVIH EDWSGLTPTT TTTTTTTAPP PADLVLNGTS
YADTLTGQGG NDKLFGFAGK DVLAGNGGND WLEGGNGNDR LTGGAGADSF VFRASGSKHM
DTITDFQPGI DHIVLDRAVF TKVGAAGALA DAAFWEGSKA HDASDRITYD SVTGTISYDS
DGTGWHSAVV VATLAPGLHL TASDFIVI