Gene Daro_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3474 
Symbol 
ID3567342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3722263 
End bp3723417 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content55% 
IMG OID637681946 
Productporin 
Protein accessionYP_286673 
Protein GI71909086 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.00203539 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.190966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA AGATTATCGC TTTGGCCATC GCTGGTCTGG CTTCTACCGC TGCTTTCGCT 
CAATCCAACG TCACCATCTA CGGTGTTGCT GATGCCACCT TCGACAGCGT CAAGGCTACC
AAGGGCTCTT CCGCCTCTGA GCAAGCTGCT TTCGTTACTC GTCAGCGCGT TACTGCCAAC
TCTTCCTACA TCGGCTTCAA GGGTGTTGAA GATCTGGGCA ACGGCCTGAA GGCTGTCTTC
CAGTTCGAAA ATGGTATTAA CAACGACAAC AGCAGCGCTG GTTCTTGGAA CAACCGTGAC
TCCTACGTTG GTCTGTCTGG TGGTTTCGGT ACCGTTGTTG CTGGTAACCT GACTGGCCCG
ACCCGTGCAG TTGGCGCCAA GTACGACGTC AATCAAGGTG CAACCGGTAT CGGCGCTAAC
GCCGCTCTGC TCGGCAAGCT GGGTACCATC GCAGGTGATT CCGGCGCTTC CGCATTCGAT
CAGCGTATCT CCAACGCTGT TGCCTACGTC TCCCCGACCG TTGCTGGTTT CACGGGCGTG
ATCGGTTACT CCACCGGTCT GTCCAGCGCC GCTATCGCTG GTACTGCCGC TGCCGTGATT
GGTACGAACC GCGAAGCTAC CGGTGCAGGT GATGTTCAGT TCAACACCGC TCGCACTCTG
GGTCTCGGCT ACGCAAATGG CCCGATCTCG GTTGATTACG CCTACACCCG TGTTGGTCTG
AAGGATGCTC AAAATGACCT CCAAGACCAT CGTCTGGGCT TCCTGTTTAG CCAAGGTTGG
GGTTCTGTCG GTCTGCTGGC CGAGCGTACC TCCCTCCAGG CTACCACTGG CAACCTGACC
CAGAACGTGT TCTATGTTCC GGTTAAGGTT AATGTTGGCA AGGGCCGTGT CATTGGTCAA
TTTGGCCACG CTGGTAACGT GAAGAACACT GTTGCTTCCG AAGGTGCTAA CCACTACGTT
CTGGGTTACG AGCACGATCT GTCCAAGCGT ACCACCCTGA AGTTGGTCTA TTCCCAGATC
AACAACAAGG AAGGTTCGAA CTACGACTAC CTGTATGGCG CTGGCAATGC TAACAGCACG
GCTACCAACA CCTCTGGTGT TGCTAACGAT GCAAACGTCA AGGGTATTTC CCTGGGCCTG
CGTCACGCTT TCTAA
 
Protein sequence
MQKKIIALAI AGLASTAAFA QSNVTIYGVA DATFDSVKAT KGSSASEQAA FVTRQRVTAN 
SSYIGFKGVE DLGNGLKAVF QFENGINNDN SSAGSWNNRD SYVGLSGGFG TVVAGNLTGP
TRAVGAKYDV NQGATGIGAN AALLGKLGTI AGDSGASAFD QRISNAVAYV SPTVAGFTGV
IGYSTGLSSA AIAGTAAAVI GTNREATGAG DVQFNTARTL GLGYANGPIS VDYAYTRVGL
KDAQNDLQDH RLGFLFSQGW GSVGLLAERT SLQATTGNLT QNVFYVPVKV NVGKGRVIGQ
FGHAGNVKNT VASEGANHYV LGYEHDLSKR TTLKLVYSQI NNKEGSNYDY LYGAGNANST
ATNTSGVAND ANVKGISLGL RHAF