Gene Saro_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2199 
Symbol 
ID3918865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2339884 
End bp2340771 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content65% 
IMG OID640444954 
ProductSMF protein 
Protein accessionYP_497471 
Protein GI87200214 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 



Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000109566 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTCG CCCTAAACGA TTCAGGGCCA GCGCTGGCGC CAATCTCGCC GCGCCGAGAG 
CTCGGCGCCT ATGAGGCCCT GTGGCTCGAA AAGGGGGCGA CCTTCAAAAC CCTGGCCGAT
CGCTTTGCGC TCGATGCCGA AGCGCTTCCG TCCGACTTCG TGCCCGCCCA GCTTGCCGAG
CAGTGCGCGG CCGAGGTGAT GGCAAAGCTC AAGAAGGCCG GTGTCCATCA GTTTGGCGTC
CGCATCCATC ATGCCGGTGA CTATCCCGCA AAGCTGCGCG ACGCGCGCCA CCCAGTCGAG
CTTCTCTATT ATCGCGGCGC CTGGGAGATC ACCGAAACCC GGTGCGTGGC CGTCGTCGGA
AGCCGCGAGG CCTCGCCCGA CGGTATCCGT CGCGCCGAGC GGCTTGCGCG CGAACTCGTC
GATCGCGATT TCACGGTCGT CTCTGGCCTT GCCAAGGGCG TCGATTCGGC TGCCCATCGC
GGCGCGATCG CGCGCGGTGG ACGCACCATT TCCGTGATCG GGACGCCGCT TGGATCCTGC
TACCCCAAGG AGAATGCCGA TCTGCAAGAG GAGATCGCCC GCGATCATCT GCTGATCTCG
CAGGTGCCGG TTCTTCGCTA CGCCAAGCAA GCACCCCAGC ATAACCGCCT TTTCTTCCCC
GAGCGCAATG TCACGATGAG CGCTCTCACC GAGGGCACGA TCATCGTCGA GGCTGGCGAT
ACGTCGGGCA CGCTGACCCA GGCGCGCGCC GCGCTCCATC AGGGCCGCAA GCTCTTCATT
CTCGACAATT GCTTTCAGCG GACGGACATC ACGTGGCCAG CCCGCTTCGA AGCCGAAGGT
GCAGTGCGCG TGAAGACGCC CGACGACATC TGGAGCGCCC TTGGTTGA
 
Protein sequence
MRLALNDSGP ALAPISPRRE LGAYEALWLE KGATFKTLAD RFALDAEALP SDFVPAQLAE 
QCAAEVMAKL KKAGVHQFGV RIHHAGDYPA KLRDARHPVE LLYYRGAWEI TETRCVAVVG
SREASPDGIR RAERLARELV DRDFTVVSGL AKGVDSAAHR GAIARGGRTI SVIGTPLGSC
YPKENADLQE EIARDHLLIS QVPVLRYAKQ APQHNRLFFP ERNVTMSALT EGTIIVEAGD
TSGTLTQARA ALHQGRKLFI LDNCFQRTDI TWPARFEAEG AVRVKTPDDI WSALG