Gene Saro_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3937 
Symbol 
ID5077421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp111024 
End bp112865 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content58% 
IMG OID640481043 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_001165705 
Protein GI146275544 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACCAG ATACTGTGAT GGCCCGGTTG GAGGCCATTC CGACGATCTC TGCGTCGGGC 
AAACGGGTAA ATGGGCTTCA TCGTTTGATG AGGTCCCCGC TTCTTTGGGA GCAGGGACTT
CGGAAAATTG CCTCCAATCG GGGGGCGATG ACGCCGGGCA TCGATGGCAA GACATTCGAG
GATTTCGGTC CCGACCGTCT CGCTCCGTTG ATCGCCAGCG TTGCGACCGG AGCCTACAAA
CCAAAACCTG TGCGTCGGGT GTTCATCCCG AAAGGCAAAG GAAAGCGGCG TCCGCTGGGG
ATTCCCACGC GAGACGACCG CCTCGTCCAG GAAGTGGCAC GCCAACTGCT GGAACGAATC
TATGAGCCGG TCTTCTCGAA GGCCTCGCAT GGATTTCGAC CGGGAAGATC GTGTCATACG
GCCCTCGAGC ACGTGAAGGC TGTCTGGACG GGCGTCAAAT GGCTTGTCGA CGTGGATGTC
GCCGGGTTCT TCGAGAACAT CGACCATGAC ATTCTGCTGA AGCTGCTCCG GAAAAGGATC
GATGACGAAA GGTTCATCGA CCTGATCCGC GACATGCTGA AGGCAGGAGT CATGGAGGGA
AGGGCTCACA CCCAGACCTA TAGCGGCACA CCACAAGGCG GGATCGTCTC CCCGATCCTG
GCCAACATCT ACCTGCACGA ACTCGATGAG TTCATGGCGG GTCGGATCAC GGCCTTTGAA
AAAGGGAAGA CCCGCGCCAC GAACCCGGAA TACCGGAGAC TGGCGGGCCG GATCGCCAAA
CGGCGAGAAC GGCTCAAACG ACTGGAAGCC AGTGACAACG CTGATCAGGT AACGGTGAAG
GCCATCTTGG CCGAAATCAA CACCTTATCA AAGCAGATGC GTTCGTTGCC GTCGAGAGAC
GCCATGGACG CCGGGTTTCG CCGACTTCGC TACTGCCGTT ACGCCGACGA TTTTCTTATC
GGTGTGATTG GCAGCAAGGA CGATGCGAGA GGGGTCTTCG CCGAAGTCAG GACCTTCCTG
ACCGAGGTAC TGGCCTTGAC CGTATCCGAG GAGAAGAGCG GAATTCGAAA AGCAAGCGAT
GGTACCAAAT TCCTCGGATA CGAGGTGCGG ACTTACACGG GACGCCAATG GACAGTGCGA
AGCCAGAACG GCACACAGCA CTTCAAGCGG CGCCCGCCAT CGGAAGTCAT GCAACTCAAT
GTTCCGTGGG ATAGGGTCAC TGCGTTTGTT GCCCGGAAGG CATACGGAGA ATGGTCCCGA
TTGAGGGCCA AACACCGCAA CCACCTTCTA AGCTGTAGCG ATGTCGAGAT TGTCCTTGCC
TACAACGCCG AACTGCGAGG GTTCGCGAAC TACTACGCTC TGGCGCGCGA TGTGAAATTC
AAGCTCAACC GGCTTGAATA CCTTCAGCGC TGGAGCATGT TCAAAACCTT GGCAAGCAAG
CACAAATCCA GTGTGCGAGT TGTTGCCGCC CGCATGAGGC AAGGGCTGGA ATACCTCGCC
GGCTATGAAG TCGGCGGCCA GCCCCGATCA GTCAAAGTCT GGAAAATGAC CGATCTGAAC
CGTGACCGGA TAGACCCGGA CAAGGTGGAC GTCCAACCTT GGACGCAAAT CTTCTCCGGC
TCGCGAACAG ATTGGGTCGA CCGGCAGAAC GCCACGCAAT GCGAAGCGTG CGGCCGATCC
GACCTCCCCT GCCATGTTCA TCATGTCAGG GGAATGGCCG ATGTTGCGCA CAGAGACCAA
GCCACGAGGA AAGCCATAGC CAGAGCGCGC AAGACGAAGG TTCTGTGCGT CCCTTGCCAC
AAGGCGATCC ATGGTGGCCC ACTACCGGAG CAGAGAACAT GA
 
Protein sequence
MLPDTVMARL EAIPTISASG KRVNGLHRLM RSPLLWEQGL RKIASNRGAM TPGIDGKTFE 
DFGPDRLAPL IASVATGAYK PKPVRRVFIP KGKGKRRPLG IPTRDDRLVQ EVARQLLERI
YEPVFSKASH GFRPGRSCHT ALEHVKAVWT GVKWLVDVDV AGFFENIDHD ILLKLLRKRI
DDERFIDLIR DMLKAGVMEG RAHTQTYSGT PQGGIVSPIL ANIYLHELDE FMAGRITAFE
KGKTRATNPE YRRLAGRIAK RRERLKRLEA SDNADQVTVK AILAEINTLS KQMRSLPSRD
AMDAGFRRLR YCRYADDFLI GVIGSKDDAR GVFAEVRTFL TEVLALTVSE EKSGIRKASD
GTKFLGYEVR TYTGRQWTVR SQNGTQHFKR RPPSEVMQLN VPWDRVTAFV ARKAYGEWSR
LRAKHRNHLL SCSDVEIVLA YNAELRGFAN YYALARDVKF KLNRLEYLQR WSMFKTLASK
HKSSVRVVAA RMRQGLEYLA GYEVGGQPRS VKVWKMTDLN RDRIDPDKVD VQPWTQIFSG
SRTDWVDRQN ATQCEACGRS DLPCHVHHVR GMADVAHRDQ ATRKAIARAR KTKVLCVPCH
KAIHGGPLPE QRT