Gene Daro_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0639 
Symbol 
ID3568906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp701114 
End bp702829 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content56% 
IMG OID637679082 
Producttype II secretion system protein E 
Protein accessionYP_283866 
Protein GI71906279 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAA CCCCTCAAAA TCCTCCACTT AGCGGCTTGG CGCGCGCGCT CGTCCAGGCC 
GGACATCTCA AGGAGGTAGA GGCGGAACAA TTGCTGGCCC AGGCCCACAG CACCAAGACT
TCGCTGATCG AACAGATCAT TACAAGCCAG AAATCCAGTG CAATCGACAT AGCGCGCTTT
GTCGCCGATA CCTTTGGCTA TCCACTACTT GACCTCAACG CTTTCGATGA AGCCCACATT
CCGTCGGATG CCATCGACCG TAAGCTGATT GCGACGCACA AGGTCATTCC ACTCAACAAG
CGCGGAAACC GTTTATCGGT AGCAATTGCT GACCCAACCA ACCTTCGTGC GCTGGACGAA
ATTCGCTTCC AGACCGGTCT GGCTGTCGAT CCGATTGTCG TCGAACACCC AAAACTGGCG
CCGCTCGTCA ACAAATATGC CGAGACAGCC GCCGAAGCGC TGAAAAACTT CACCAGCGAG
GATCTCAACC TCGATTTTCT GGACGAAGAA ACCTCCAGCA AAGCTGACGA AGCCGCAGGG
CAGGAAATCG ATGACGCACC GGTCGTCAAA TTCATCCAAA AAATGCTGCT CGATGCCATC
AATGATGGAG CATCGGACAT CCATTTCGAA CCATACGAAA AGTTTTATCG CATTCGCTTC
CGCGTCGACG GCATCCTGCG CGAAGTAGCC ACTCCACCGC TGGCCATCAA GGAAAAAATT
GCCTCGCGCA TCAAGGTCAT TTCCCGGCTT AATATCGCCG AAAAGCGCGT CCCGCAGGAC
GGCCGGATGA AACTGGTGCT TTCGAAGACC CGTGCCATCG ATTTCCGGGT CAGCACACTA
CCGACGCTTC AGGGCGAGAA AATCGTTATG CGTATTCTCG ACCCAAGCTC AGCCACCTTG
GGCATCGAGG CGCTAGGCTA CGAGCCGGAG CAAAAAGCAG CAATAATGGA CGCCATCAGC
CGCCCCTATG GGATGGTACT GGTCACCGGA CCGACAGGTT CTGGCAAAAC CGTTTCGCTC
TACACCTGCC TCAACATCCT GAATAAGGAT GGCATCAACA TTTCGACAGC GGAAGACCCG
GCTGAAATCA ATCTACCGGG TGTCAATCAA GTCAACGTCG ACGACCGTGC TGGCCTGACC
TTCCCTGTCG CACTAAAGGC ATTCCTGCGC CAGGATCCGG ACATCATCAT GGTCGGCGAA
ATTCGTGACC TGGAGACCGC CGAGATTTCC ATCAAGGCGG CACAAACGGG TCACCTGGTG
CTCTCGACGC TGCACACCAA CGATGCCCCG CAAACACTGA CTCGACTGAT GAACATGGGC
GTCCCCATGT TCAATATCGC CTCCAGCGTG CTGTTGATCA CCGCTCAGCG CTTGGCGCGA
CGGCTATGCA ACTGCAAGAA ACCGATGACC GTACCCGACC AGGCACTACT GGATGCAGGC
TACTCGGAAG CCGATCTCGA CGGTTCGTGG ACGCTGTTTG GCCCAGGAGG ATGTGAACGG
TGCAAAGGGA CCGGCTACAA GGGACGGGTC GGCATTTATC AGGTCATGCC CATTTCCGAA
GCTATGCAAC GCATGATCAT GAGCGGCGCA TCTGCACTGG ACTTGGGCGC CCAAGCGAAA
GCCGAAGGCG TGAAAAACCT CCGCGAATCC GGACTATTAA AAGTCAAACA AGGTGTGACC
TCGCTTGACG AGGTGCTCAG CACCACCAAC GCTTAA
 
Protein sequence
MAATPQNPPL SGLARALVQA GHLKEVEAEQ LLAQAHSTKT SLIEQIITSQ KSSAIDIARF 
VADTFGYPLL DLNAFDEAHI PSDAIDRKLI ATHKVIPLNK RGNRLSVAIA DPTNLRALDE
IRFQTGLAVD PIVVEHPKLA PLVNKYAETA AEALKNFTSE DLNLDFLDEE TSSKADEAAG
QEIDDAPVVK FIQKMLLDAI NDGASDIHFE PYEKFYRIRF RVDGILREVA TPPLAIKEKI
ASRIKVISRL NIAEKRVPQD GRMKLVLSKT RAIDFRVSTL PTLQGEKIVM RILDPSSATL
GIEALGYEPE QKAAIMDAIS RPYGMVLVTG PTGSGKTVSL YTCLNILNKD GINISTAEDP
AEINLPGVNQ VNVDDRAGLT FPVALKAFLR QDPDIIMVGE IRDLETAEIS IKAAQTGHLV
LSTLHTNDAP QTLTRLMNMG VPMFNIASSV LLITAQRLAR RLCNCKKPMT VPDQALLDAG
YSEADLDGSW TLFGPGGCER CKGTGYKGRV GIYQVMPISE AMQRMIMSGA SALDLGAQAK
AEGVKNLRES GLLKVKQGVT SLDEVLSTTN A