Gene Daro_3570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3570 
Symbol 
ID3566382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3824057 
End bp3825352 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content43% 
IMG OID637682043 
Productankyrin 
Protein accessionYP_286769 
Protein GI71909182 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value0.85862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATT CATCGAATCG ACTGTATCGC TCAATCTATA TCCTGAGTAC TTTACTAGTC 
ATTGCGCTGT TATCGCCCGC AAAAGCTGAA AGTTTCCTGT GGGATGCAGT TTCGAAAAAT
GACGTCGATA AAGCTAAAAC ACTCATTTCT TTTGGTGCAA ATGTCAATCA AAAAAATGCT
TACGGTAACC CAATAATTCA CCATGCTGTA GCAGAAGGAA ATCTAGAGAT AGTCGAATTA
TTGATCTCCA AAGGTGCAGA CGTAAATGCA AAAGGTCAAT TTGACCGGGT GGCGCTTCAT
TACGCAAACA AGAAAGGCAT GGCCAAAACC CTTCTGGCGC ACCGCGCGAT TGTAGATACA
CCTACCAATT ACGGAGAAAC ACCACTGCAC TGGGCAGCAA GTGGGGTGAA TGGATTCGGA
AAACAAGTCG ATTTGGTTGA GTTTGCTGAG GTTTTAATTG CAAATGGAGC CGACGTAAAC
AAAAAAACAG GTGAAGGTAG GTCAAATAAA ACACCGCTCA ATTATGCGGC AGAGTCAAAC
AATCTGCCTG TTGCTAAGAT CCTTATCGCT CACGGTGCAG ACGTTGATGG TGGCGGTTCT
TCACCATTAA GTTCTGCAGG GGGAAATGGT GATTATGTCG AGATGGCTCA ACTACTTGTA
GAGCATGGCG CTGGAGTAAA CACCCCTTCA ATAGGAGGTT GGTATCCACT TCATTCTGCT
GCGGGGAGAG GAAATATTAA TGTAACTAAT TATTTACTGG CACACGGAGC AGACCCGAAT
GCCACAACCA CAAACAGGGA TAAATACACT GCTCTTTATG TGGCATCAGG AAGTGATTAT
CACGCGAAGG TCGTAGAGTC TTTGTTAAAA AGCGGGGCCA ACCCAAACAT CAGGATTGCC
AATAGCTTAG TTCCACTTCA TATGGCGACA TCCGAAGGTG CAATAAAGAC TGTTGAAGTA
TTACTAGATC ACAAGGCGGA GATAAATATT GCCACTAGCG ATGGCACAAC GCCGCTTCAT
CTGGGTATCA CTCTTGGAAA AAATGATAAT AGAAAAGATG TCGTTGCGCT TTTATTAAAG
AACGGCGCAA ATGTTAATAC ATTAAACATT CGCAGCGGGA TGACGCCGCT AACAGAAGCG
ATAAATCGAA ATGATGTTGA TATAGTAAAA CTACTAATAT CAAATGGCGC AGATTTAAAT
ATAATGGGAA TTGTCGGCAA TAAAGCTCTT GCCGCAGCAC GAAATTCCAT TGCTATTACA
GATTTGCTGA AAGAGCACGG TGCGCATCAA CCTTAA
 
Protein sequence
MSYSSNRLYR SIYILSTLLV IALLSPAKAE SFLWDAVSKN DVDKAKTLIS FGANVNQKNA 
YGNPIIHHAV AEGNLEIVEL LISKGADVNA KGQFDRVALH YANKKGMAKT LLAHRAIVDT
PTNYGETPLH WAASGVNGFG KQVDLVEFAE VLIANGADVN KKTGEGRSNK TPLNYAAESN
NLPVAKILIA HGADVDGGGS SPLSSAGGNG DYVEMAQLLV EHGAGVNTPS IGGWYPLHSA
AGRGNINVTN YLLAHGADPN ATTTNRDKYT ALYVASGSDY HAKVVESLLK SGANPNIRIA
NSLVPLHMAT SEGAIKTVEV LLDHKAEINI ATSDGTTPLH LGITLGKNDN RKDVVALLLK
NGANVNTLNI RSGMTPLTEA INRNDVDIVK LLISNGADLN IMGIVGNKAL AAARNSIAIT
DLLKEHGAHQ P