Gene Saro_2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2196 
Symbol 
ID3918862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2334380 
End bp2336281 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content59% 
IMG OID640444951 
ProductRNA-directed DNA polymerase 
Protein accessionYP_497468 
Protein GI87200211 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.035125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCCAG ATACCATTGA GAGGGCCGTC AGATCACTCC CGACCGTCAT CAGGTCGGGT 
CGGAAGGTCA ATGGCCTCTA TCGCCTGCTG AAAAGCCCGC TCCTCTGGGA GCATGCCTAC
CAGCGGATCG CCCCGAATAA AGGGGCGATG ACGCCCGGAG TTGATGGTCA GACATTCGAC
GGCTTCTCGC CCGACAAGGT CAGGTCCATC ATCGAACGGC TTGCGAACGG GACCTATCGA
CCTCAACCGG CAAGACGGGT GTATATCCCG AAAGCCAATG GCCAGAAACG GCCACTTGGC
GTGCCGACCA CGGAAGACAA GTTGGTCCAG GAAGTGGTGC GGACGATCCT CGAGCAAATC
TATGAGCCGC TGTTTTCCAG ACACTCTCAT GGGTTCCGCC CGAAACGTTC ATGCCACACG
GCGTTGGAAT CGATCCGCGC CATCTGGACC GGGGTGAAAT GGCTGATCGA CGTCGATGTT
GTCGGGTTCT TCGACAACAT CGATCATGAT GTCCTCGTAT CCCTGCTCGA AAAACGGATT
GCGGATCGTC GCTTCGTGCG GCTCATCCGG GGTCTACTCA AGGCCGGGTA TGTCGAAGAC
TGGGTCTTTC ACAAGACCTA CAGCGGAACG CCCCAAGGCG GGGTCGTCTC CCCGATGCTG
GCGAACATCT ACCTGCATGA ACTGGATATG TTCATGCAGG CCAAGATGGC TGGCTTCGAC
AAAGGGAAGC AGCGATCACC ATCCCCTGAT GCCCGGCGCA TCAGGAACCG CCTGTCCTAT
GTCCGCCGCA CAGTGGATCA ACTTCGCGCC AAAGGGCGCG GCGACGATCC CAGAGTCACT
TCCTTCTTGG AAGAGATCGG TCGGCTCAAG GCGGAACGGC TTGCCGTTCC GGCCAGCGAC
GCCTTTGATC CGAACTACAG GAGACTGCGC TACTGCCGAT ACGCCGACGA CTTCATCATC
GGCGTCACTG GTAGCAAATC GGAAGCACGA CAAATCATGG AGGAGGTCAG GACCTACCTG
TCCGATCACC TGAAGTTGGC CGTGTCCGCC GAGAAAAGCG GAATCCACAA GGCTTCGGAT
GGGGCACGGT TCCTCGGATA CGAAGTCCGG ACCATGACCA ATCCCAATCC GCACAAGGCG
ATCTTCGATG GTCGTCCGGC AGTGCGGCGA GGCTTGGCCG ACCGGATGAA ACTTCTGGTG
CCAAGGGACC GCGTCGTGCG GTTCGTCAAC TCAAAGGAAT GGGGCGACTA CGACAGCTTC
AGACCTGTTG GGAGGGCGGC CTTGCGCTTC GCAAGCGATG TGGAAATCGT CCTAGCCTAT
AACGCCGAAT GGCGCGGCTT TGCGAACTAT TATGCCATTG CTGACGACGT GAAACGCAAG
CTGAACAAGG CAGGTTACTT CGCCTTGCTC TCATGCGTGA AAACCATCGC CGGAAAGCAC
AGGACATCGG CTCGCAGAGT CTTCGCCAAA CTGCGTCGCG GTACGGACTT CTATATCAGC
TACGAGGTGG GAGACACCAC CCGGACAATA AAACTGTGGC AGTTGAAGGA CCTCCAGCGA
CACACGCGCA CCTGGGGCGG GATCGATATC CCTTCCTCAG CAAAGTTCGT GTTCAGCAGG
ACAGAGCTGG TCGAGCGGCT CAACGCCCGC GAATGCGAGC GTTGCGGTAG CAATGACCAA
CCTTGTGAGG TTCATCACGT CCGCAGGATC GGCGAATTGC AGCATGCCGG GTTCAGTCGC
CATATGGCGG CCGCCCGTCA GCGTAAACGC ATGGTCTTGT GCTCCCGCTG CCACAACGAT
GTCCATGCCG GACAGCCGAC CGACCGCCAA CGGCGGACAG CACGCAGTCG TGGAGAGCCG
AATGCGCTGA AAGGTGCACG TTCGGTTCGG AGGGGGGCCT AG
 
Protein sequence
MLPDTIERAV RSLPTVIRSG RKVNGLYRLL KSPLLWEHAY QRIAPNKGAM TPGVDGQTFD 
GFSPDKVRSI IERLANGTYR PQPARRVYIP KANGQKRPLG VPTTEDKLVQ EVVRTILEQI
YEPLFSRHSH GFRPKRSCHT ALESIRAIWT GVKWLIDVDV VGFFDNIDHD VLVSLLEKRI
ADRRFVRLIR GLLKAGYVED WVFHKTYSGT PQGGVVSPML ANIYLHELDM FMQAKMAGFD
KGKQRSPSPD ARRIRNRLSY VRRTVDQLRA KGRGDDPRVT SFLEEIGRLK AERLAVPASD
AFDPNYRRLR YCRYADDFII GVTGSKSEAR QIMEEVRTYL SDHLKLAVSA EKSGIHKASD
GARFLGYEVR TMTNPNPHKA IFDGRPAVRR GLADRMKLLV PRDRVVRFVN SKEWGDYDSF
RPVGRAALRF ASDVEIVLAY NAEWRGFANY YAIADDVKRK LNKAGYFALL SCVKTIAGKH
RTSARRVFAK LRRGTDFYIS YEVGDTTRTI KLWQLKDLQR HTRTWGGIDI PSSAKFVFSR
TELVERLNAR ECERCGSNDQ PCEVHHVRRI GELQHAGFSR HMAAARQRKR MVLCSRCHND
VHAGQPTDRQ RRTARSRGEP NALKGARSVR RGA