Gene Daro_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3189 
Symbol 
ID3566859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3425846 
End bp3427111 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content60% 
IMG OID637681660 
Producthypothetical protein 
Protein accessionYP_286389 
Protein GI71908802 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAG CCAACGACAA CGCCAGCATG GCGCTCTTTT GCGACTTCGA GAATATCGCT 
CTCGGCGTGC GCGATGCCCA GTATGAAAAA TTCGATATTC GTCCGGTGCT TGAGCGTCTG
CTCGCCAAGG GCAGCATCGT CGTCAAGAAG GCGTATTGCG ACTGGGATCG CTACAAGGCT
TTCAAGGCGG CCATGCACGA AGCCAATTTC GAGCTGATCG AAATTCCGCA TGTCCGCCAG
TCCGGCAAGA ACTCAGCCGA CATCCGCATG GTCGTCGATG CGCTCGACCT CTGCTACACC
AAAGCCCATG TCGATACCTT CGTGATCATT TCCGGCGACT CCGATTTTTC CCCGCTGGTC
TCCAAGCTGC GTGAAAACGC CAAGCGAGTG ATCGGCGTTG GCGTCAAGCA AAGCTGCTCC
GACTTGCTGG TGACCAATTG CGACGAATTC ATCTATTACG ATGATCTGGT TCGCGATCGC
GAGGCCGGGC GCGGGCCACA ACAACGCCGC GAGCGGGAAA AGCGTTCGCC GGAAGAAGAG
GCCAAGCGCC GCGACAAGCA GGAAGAGCGC AAGAGCAAGG CAATCGATAT CGTTTCCGCT
ACCTTTGTCG ACCTGATGGC CGACCGGGGC GAAAGCGAGC GCATCTGGGC TTCGGTGCTC
AAGGAGGTCG TCAAGCGGCG TAATCCGGGC TTCAATGAGA GCTATTACGG CTTCCGGACC
TTTGGTAACC TGCTGGAAGA GGCGGCTGGT CGTGGCCTTA TTGGCTTTGG CCGCGATGAC
AAGGGAGCTT TCGTTTTCCG TGCCCCGCCG AAATCCACGG CGACTCAAGA GGCCGAGATT
GTCGGCCAGG CACTCGTCGT CAATGCAGCG GCGGTAGAAG TCACTGCCGA AGTCGAAAGC
AAACCGGAAG CGCCGATGCT TGAGCCGGAA TCGAGCGCCA ATCCACGGCG ACGTGGTGGG
CGCCGTAGCC GTAACGGGCG TGATAAGGAG CGCTCACCGT CTGTCGAGGC GTTTGTCGAT
TCAACTCCCC CGGTCGAAAT TGCGCCTGTT GTGGAGGCTC CTGTAGTGCT TGAGCCGGCG
GCTCCTGCCG ACACCAAGCC TCGTCGCGGT GCCCGTCGTC CACGTAATAC CGAGGCGCTT
GCGCCTGCCT CGGCTGATGT CGAGGCCTCT TTGCCAGAAG TTCCTGTCGT CGAGTCGAAG
CCGGCCAAGC CCAAACGGCC GGCGCGGCCG CGCAAACCGA AGGTCGTGGA GCCTTCGGAC
GCTTGA
 
Protein sequence
MAAANDNASM ALFCDFENIA LGVRDAQYEK FDIRPVLERL LAKGSIVVKK AYCDWDRYKA 
FKAAMHEANF ELIEIPHVRQ SGKNSADIRM VVDALDLCYT KAHVDTFVII SGDSDFSPLV
SKLRENAKRV IGVGVKQSCS DLLVTNCDEF IYYDDLVRDR EAGRGPQQRR EREKRSPEEE
AKRRDKQEER KSKAIDIVSA TFVDLMADRG ESERIWASVL KEVVKRRNPG FNESYYGFRT
FGNLLEEAAG RGLIGFGRDD KGAFVFRAPP KSTATQEAEI VGQALVVNAA AVEVTAEVES
KPEAPMLEPE SSANPRRRGG RRSRNGRDKE RSPSVEAFVD STPPVEIAPV VEAPVVLEPA
APADTKPRRG ARRPRNTEAL APASADVEAS LPEVPVVESK PAKPKRPARP RKPKVVEPSD
A