Gene Saro_2211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2211 
Symbol 
ID3916527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2354601 
End bp2355872 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content58% 
IMG OID640444966 
Productphage integrase 
Protein accessionYP_497483 
Protein GI87200226 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.684899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAACA AAGTAGGTTT GACGGATGCA AAGATCGCAG GGCTGAAGGC GCCTGCTGCG 
GGTCAGATTG AGCTCGCCGA TGGCATCGTA CCCGGTCTTA GGCTCCGCAT GGGCGCCAGT
GGTATCAAGA CGTACATTCT GCGCAAGCGG GTTCAGGGCA AATGGCTGAA TGTGACAATC
GGACGTCATG GGCCAAGCTT TACACTGGCC CATGCTCGCA AGAAGGCGCG CGATCTGCTC
GTGGATGTCG AGCAGGGCAA GAGCATCGCT AGAAAAGCGG GCGCAAAGAG GAAGGGATCA
AAAGGCGTCG GGACAGTCGC GGAGCTCTAC GAGACGTACC TCGCTCAGCA GATCGAGGGC
AAAAAGCGGA GCGCGAGGGA GTTCGATCGA GTGTTCCGAA AATACATCGA GCCCGAGATT
GGAGACCGGC TCGCGGACTC CATAACCCGA AGTGACGTGA GCCGCTTCGT TGAGAAAATT
GCGTTCGAGC GGGGCAAGGA AACCCTGACG ATGGCGCGGA TCGTGTATCG CCACCTTTCA
ACTTTCTACA CTTGGGCCCT CACCAGATTG GAGCATATGC CCGCCAATCC TTGCCGGGAT
GCATGGCGGC CGAAGCGGAA TGAGCCTCGC GACCGTGTAC TCAGCGACCG TGAGCTTGCT
GCGCTGTGGC AATCTGCGGT CGAGGATGGC TATCCGTTCG GCCATCTCGT GCAGATGCTA
ATCCTTACGG CTCAACGCAG AGGTGAAGTG CTTGATGCCG CGTGCGACGA GTTCGACTTC
AAAGCCAAGG TCTGGACCGT CCCAGGTGAT CGAGCAAAAA ACGGCAAGGC CAATGTGGTG
CCAATATCCG CTCAGGCTCT TGGAGTGATC ACCGAGATAT TCCGGGCTGC CGGGATTGAC
CCCGATGACG CCCACAAACA ATCCCAGATC CTGTTGGCTT CGAAGGTCAC CAATACGAAC
AGTGTCAGCG GACTATCCAA GGCCTGGAAG CGGATCAGAG CAAGCGTGGA CGAGAAGCTC
GGCTACGAAG CCGGACACTT CACAATGCAT GATATTCGAC GGACGGTGGC GACCGGCCTC
CAGCGTCTTG GCATACCGTT GGTGGTTTCT GAGGCGGTCC TGAATCACCA ATCAGGATCA
GCTATGGCTG GCGTCGCTGG GGTCTATCAC CGTCACCAGT ATACCAACGA GAAACGGGAG
GCGTTGGCGC TGTGGGGTAA GGAGGTAATG CAGATTGCAG CCAAGTACCC AGTCGAAGTC
AAAGACGGCT GA
 
Protein sequence
MANKVGLTDA KIAGLKAPAA GQIELADGIV PGLRLRMGAS GIKTYILRKR VQGKWLNVTI 
GRHGPSFTLA HARKKARDLL VDVEQGKSIA RKAGAKRKGS KGVGTVAELY ETYLAQQIEG
KKRSAREFDR VFRKYIEPEI GDRLADSITR SDVSRFVEKI AFERGKETLT MARIVYRHLS
TFYTWALTRL EHMPANPCRD AWRPKRNEPR DRVLSDRELA ALWQSAVEDG YPFGHLVQML
ILTAQRRGEV LDAACDEFDF KAKVWTVPGD RAKNGKANVV PISAQALGVI TEIFRAAGID
PDDAHKQSQI LLASKVTNTN SVSGLSKAWK RIRASVDEKL GYEAGHFTMH DIRRTVATGL
QRLGIPLVVS EAVLNHQSGS AMAGVAGVYH RHQYTNEKRE ALALWGKEVM QIAAKYPVEV
KDG