Gene Ava_C0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0033 
Symbol 
ID3678109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp51340 
End bp53223 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content42% 
IMG OID637715117 
ProductRNA-directed DNA polymerase 
Protein accessionYP_320311 
Protein GI75812694 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.590524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTCAAA AGCCAAAGCC CAGAGTAATG CCTGGGATAT GCAGACAGAC ACACGACATT 
TTGGAAGGTA TAGTCTCCAC TAGCAGTAAT ATGTCTAAAG CCGAACCACA GTTCAGGATG
GAATCAGAAC AAGACAAACC GCACAAACCC CGAATTGACC CGACCATTCA ATGGCAGTCC
ATTCCCTGGA AGAAGCTGGA ACGTCGAGTT TACAAGCTGC AAAAAAGAAT ATACCAAGCC
GCGAATCGTG AAGATGTCAC TACAGTTCGC CAGCTTCAGA AAACCCTACT CAAGTCCTGG
TCAGCAAAAT GTATTGCGGT CAGAAAGGTA ACTCAGGATA ATCAAGGGAA AAAGACGGCT
GGGGTAGATG GTGTCAAACT ATTAACACCC CACCAACGTT TGCAACTGGT TAAACGTCTC
AAACTCTCTT CCAAAGCAAA TGCTGTGCGA AGAGTCTGGA TACCAAAGCC TCTAACCCAA
GAAAAAAGAC CTCTGGGTAT TCCCACAATG TATGACCGTG CATTGCAAGC CTTAGTCAAA
ATGGCACTAG AACCCGAATG GGAAGCACGT TTTGAACCTA ATTCTTATGG CTTTCGACCA
GGGCGTTCGG TGCATGATGC AATTAGTGCA ATTTATCTTA ATATCAAGCA AAAGGCAAAG
TATGTGCTAG ATGCTGATAT TTGCAAGTGC TTTGAACGCA TCAACCATGA AGCATTATTG
ACAAAACTGA ACACATTTCC CAGCCTACGC CGTCAAATCA AAGCATGGCT CAGGGCTGGG
GTGTTAGATG GAGATAGTTT ATTTCCTAAC CTGGAAGGGA CACCACAAGG AGGCGTTATA
TCTCCTTTGC TGGCAAATAT TGCCTTACAT GGGTTAGAAA ATCAGATTAA GATGGCATTT
CCGCGAATAG ACCGCAAAAT TAATGGTAAG AAGCGGACAA TTCGACCACC AGCCTTAATC
AGATATGCAG ATGACTTTGT GGTCATCCAT GAAGACCTCT CCATAGTTCA AAAATGTCAA
ACTCTAATTG CTGATTGGTT AAAAGGCATC GGACTAGAGT TAAAACCAAG TAAAACATCT
TTAACTCATA CTTTCCAATC TTATGAAGGA AAATTGGGTT TTGACTTTCT CGGATTTCAT
ATTAGGCAAT ATCCCGTAGG AAACTACCTA AGTGCCAAAA ATTCGTATGG TCAGTTACTT
GGATTTAAAA CCCTGATTAC TCCAAGCAAG GATAAGTTGA AACAGCATTT GCTTCACATC
GCCCGAATAA TTGACACCCA TGCTCTCTCT CCACAAGAAA CTCTCATCAG TAAATTAAAT
CCTGTGATCA AAGGGTGGGC AAACTTTTAT TCTATCGGTG TCAGTAGCAG AGCTTTTTCA
AAAGCGGACT TTTTAACTTA TCAAAAACTG CGTGCTTGGG CTACTAATAG ATGTACTCAC
AGTAGTAAGC ACGAAATTGC TAACAAATAC TGGAGAGTAG CTCAAACAAA TAAATGGTGT
TTTAGTACAC CTGATGGTCA TCAGCTTATT AAGCACAGTG ATACCAAAAT TATCAGACAC
GTCAAAGTGA AAGATAATCG CAGTCCCTAC GATGGAGATT GGGTTTACTG GAGTTGCAGA
ATGGGTCATC ACCCAGAAGC GCCTACAACA GTTGCAACTT TATTAAAAGA GCAACAAGGT
AAATGCGCTC ATTGCGGACT CTTTTTCCGT GATGGAGATT TGATGGAAGT TGACCATGTT
ATTCCTAAAT CTAGAAAAGG GAAAAACTCT TACAACAACC TCCAATTACT CCACCGCCAT
TGTCACGATA CTAAAACTGT TACAGATGGG TCAATCTCAC GTTCAGTTTT AACACAGGAT
TATCTAGAGG CTCATCCCTT TTAA
 
Protein sequence
MCQKPKPRVM PGICRQTHDI LEGIVSTSSN MSKAEPQFRM ESEQDKPHKP RIDPTIQWQS 
IPWKKLERRV YKLQKRIYQA ANREDVTTVR QLQKTLLKSW SAKCIAVRKV TQDNQGKKTA
GVDGVKLLTP HQRLQLVKRL KLSSKANAVR RVWIPKPLTQ EKRPLGIPTM YDRALQALVK
MALEPEWEAR FEPNSYGFRP GRSVHDAISA IYLNIKQKAK YVLDADICKC FERINHEALL
TKLNTFPSLR RQIKAWLRAG VLDGDSLFPN LEGTPQGGVI SPLLANIALH GLENQIKMAF
PRIDRKINGK KRTIRPPALI RYADDFVVIH EDLSIVQKCQ TLIADWLKGI GLELKPSKTS
LTHTFQSYEG KLGFDFLGFH IRQYPVGNYL SAKNSYGQLL GFKTLITPSK DKLKQHLLHI
ARIIDTHALS PQETLISKLN PVIKGWANFY SIGVSSRAFS KADFLTYQKL RAWATNRCTH
SSKHEIANKY WRVAQTNKWC FSTPDGHQLI KHSDTKIIRH VKVKDNRSPY DGDWVYWSCR
MGHHPEAPTT VATLLKEQQG KCAHCGLFFR DGDLMEVDHV IPKSRKGKNS YNNLQLLHRH
CHDTKTVTDG SISRSVLTQD YLEAHPF