Gene Saro_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2201 
Symbol 
ID3918867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2341420 
End bp2342457 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID640444956 
ProductSMF protein 
Protein accessionYP_497473 
Protein GI87200216 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 



Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000301868 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGAC CAAGCCACGA CGGACCGCCA GACCGTATCA GCGACGTCCA TGCGGCGACC 
GCGCTCACTC CGCCGAGCGC GGGCACGATC GACGAGGAGC GTGCCTTTCT CGCTCTCGCG
ACGGTGCGGG GCATTGGTCA GAAAACCCTG TTCGCCATGG CCGACGACAA AAGATCGTTC
AGCGATGCGC TCGAATATGG GCCAGAGGCG TCCAACCCTG CAGGCAGCGA CGGCAAGGCC
ATCTCAGAGC GGCACTGGTC GCGGGTTCGC GGGCATGCGC TTGAACAGGG TGACCGCCTG
GCCGAGCATC TCGAAGCGCT CGGCATCGGT CTGCTCTTTC GGGGGTCGCC CGGCTTTCCC
TCCGCCTTGC TCGACCTTGA ACGTCCGCCG CACTGGCTTT TCGTGCAGGG CAGCGTCGAG
CGCCTTGCCG AGCCGTCCAT TGCGGTCGTC GGTACCCGCA AGCCCAGCGC CGACGGCTTC
TTCCTGTCAC GCTATGTGGG CGCTTGTCTC GGCGAATGGG GTGTACCGAC CGTCAGTGGC
CTCGCGGCCG GCATCGATCA GCTGGCGCAT GAACACTCGC TACGCGCTGG CGTGCCGACG
ATCGCGGTGC TGGGCACCGG CATGCTCGAA GACTATCCCA AGGGCTCAGG TCGACTGCGC
GATCATATTC TGGCGACCGG CGGCGCGATC GTCAGCGAGT ATCTACCAAC AGCGTCCTAC
AGCGCCGAGA ATTTCGTCCA GCGCAACCGG CTCCAGGCGG CGCTCGGCCG GATCCTGATC
CCAGCCGAAT GGAATCGCCG CAGCGGCACG GCCCATACGG TCCGCTTCGC GACCGCGCTT
GGGCGGCCTA TTGCCTGCCT GCGCTTGCCT GAGTGGCCGG ACGAGCGCGT AGTGCTGGAG
CGTGGCATGG GGCTTCCGAC CGGCGAAATC TTCACCGTAC CGCACGACCA GGGACGGTTC
GACGCCTTCG TCCGGTCGGC GATCGGCAAG TCTTCACCCG CTCAGTTGGG CCAACTTTCG
CTATTTGGGG ATAGCTAG
 
Protein sequence
MDRPSHDGPP DRISDVHAAT ALTPPSAGTI DEERAFLALA TVRGIGQKTL FAMADDKRSF 
SDALEYGPEA SNPAGSDGKA ISERHWSRVR GHALEQGDRL AEHLEALGIG LLFRGSPGFP
SALLDLERPP HWLFVQGSVE RLAEPSIAVV GTRKPSADGF FLSRYVGACL GEWGVPTVSG
LAAGIDQLAH EHSLRAGVPT IAVLGTGMLE DYPKGSGRLR DHILATGGAI VSEYLPTASY
SAENFVQRNR LQAALGRILI PAEWNRRSGT AHTVRFATAL GRPIACLRLP EWPDERVVLE
RGMGLPTGEI FTVPHDQGRF DAFVRSAIGK SSPAQLGQLS LFGDS