Gene Daro_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4114 
Symbol 
ID3566692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4411943 
End bp4413343 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID637682586 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_287310 
Protein GI71909723 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.181857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAG GTTCAATCGT TCAGTGCATC GGCGCCGTTG TGGACATCCA CTTCCCGCGC 
GACGCGATGC CGAAGGTCTA CGACGCTCTC AAGCTGGATG CTTCCGAAGC AAACGGCATG
GCAGAAGATG GCCTGACCTT CGAAGTGCAA CAACAACTGG GTGACGGCGT CGTCCGCACC
ATCGCCATGG GTTCGTCCGA CGGTCTGCGT CGTGGCATGA AGGTGAACAA CACCGGCGCC
GGCATTTCCG TGCCTGTCGG TATGGGCACC CTGGGCCGCA TCATGGATGT GCTGGGTCGC
CCGATCGACG AAGCCGGTCC GATCGATTCC ACCGAGCTGC GCGTTATTCA CCAGCCGGCT
CCGAAGTTTG ACGAACTGTC GTCTTCCGTC GATCTGCTGG AAACCGGCAT CAAGGTTATC
GATCTGATCT GCCCGTTCGC CAAGGGCGGC AAGGTTGGCC TGTTCGGTGG CGCCGGCGTC
GGCAAGACCG TTAACATGAT GGAACTGATC AACAACATCG CGAAGCAGCA CGCCGGCTTG
TCCGTGTTTG CTGGCGTGGG CGAGCGTACC CGTGAAGGTA ACGACTTCTA CCACGAAATG
AAGGACTCCA ACGTTCTCGA CAAGGTCGCG ATGGTGTTCG GTCAGATGAA CGAGCCGCCG
GGCAACCGTC TGCGCGTCGC GCTGACCGGC CTGACCATGG CTGAGCGTTT CCGTGACGAC
GGTCGTGACA TTCTGTTCTT CGTCGACAAC ATTTACCGCT ACACGCTGGC TGGTACGGAA
GTTTCCGCAC TGCTTGGCCG TATGCCTTCC GCCGTGGGCT ATCAGCCGAC GCTGGCTGAA
GAAATGGGTC GTCTGCAAGA GCGTATCACC TCGACCAAGG TTGGTTCGAT CACCTCCATC
CAGGCCGTTT ACGTGCCTGC CGATGACTTG ACCGACCCGT CCCCGGCTAC CACCTTCCTG
CACTTGGACT CCACGGTTGT GTTGTCGCGT GACATCGCCT CGCTGGGTAT CTACCCGGCT
GTCGATCCGC TTGACTCCAC TTCCCGCCAG CTCGACCCGC AGGTCGTTGG TGAAGAGCAT
TACTCCGTTG CCCGTGCCGT GCAGATGAAT CTGCAGCGCT ACAAGGAACT GCGTGACATC
ATCGCGATTC TGGGTATGGA CGAACTGTCT CCGGAAGACA AGCTGGCCGT GTCCCGCGCC
CGCAAGATTC AGCGCTTCCT GTCGCAGCCG TTCCACGTGG CTGAAGTCTT CACCGGCTCG
CCGGGCAAGT TCGTTTCCCT GAAAGAAACG ATCAAGGGCT TCAAGGGCAT TTGTGCCGGC
GAATACGATC ACCTGCCGGA ACAAGCGTTC TACATGGTGG GCGGTATCGA GGAAGTCATC
GAGAAGGCCA AGACGCTGTA A
 
Protein sequence
MSQGSIVQCI GAVVDIHFPR DAMPKVYDAL KLDASEANGM AEDGLTFEVQ QQLGDGVVRT 
IAMGSSDGLR RGMKVNNTGA GISVPVGMGT LGRIMDVLGR PIDEAGPIDS TELRVIHQPA
PKFDELSSSV DLLETGIKVI DLICPFAKGG KVGLFGGAGV GKTVNMMELI NNIAKQHAGL
SVFAGVGERT REGNDFYHEM KDSNVLDKVA MVFGQMNEPP GNRLRVALTG LTMAERFRDD
GRDILFFVDN IYRYTLAGTE VSALLGRMPS AVGYQPTLAE EMGRLQERIT STKVGSITSI
QAVYVPADDL TDPSPATTFL HLDSTVVLSR DIASLGIYPA VDPLDSTSRQ LDPQVVGEEH
YSVARAVQMN LQRYKELRDI IAILGMDELS PEDKLAVSRA RKIQRFLSQP FHVAEVFTGS
PGKFVSLKET IKGFKGICAG EYDHLPEQAF YMVGGIEEVI EKAKTL