Gene Dret_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1103 
Symbol 
ID8418928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1294039 
End bp1295754 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content56% 
IMG OID645037675 
Producttype IV-A pilus assembly ATPase PilB 
Protein accessionYP_003197969 
Protein GI258405227 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.202418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00743342 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCAAGC AGACGACCAC CGAGGCCCTG AAACAATGGG CCCAATTCAC TGCAGAAGAA 
ATCAGGGACA TAGAAGAACT CCAGCGTCAG AAGCGGACCA GCTTTCTTGC GGCGGCGTTC
GATAAAGATA TCCTCAGGGA CCAGGACTAT CTGGAATTTC TTTCCCAACG TCTGTCCATG
CCGTGCGCTG CGCCGGAACT TTTTGATATC GCTCAGGATA TCTTTGAACT TGTGCCTTCA
GAGTTGTGCC GCAAGTACGA GGCGGTCCCG TTCTTCCGCC ACAACAACAC CCTGTTCATC
GCCACCGCCG ACCCGGAAAA TCTCCTCGCC CTTGACGACA TCCGTTTTGT GACCGGCATG
GAACTCGCCG TGCACATCGC CACGCCCACG AGCATCGCTG TCAGTCTGGA GAACTATCTT
AAAGGCGAAG AGTCTGGCGG GAATTTTGGC GATTTGGACG AGGCCCTGGC CGACATTGCC
GAGTCCGATG TTGAAATTTC GCGCAAGAGC GAGGAATCCG CTTCGGAAGA ACCTTCGGTG
CTCGAGGCCG CTTCTCAGGC TCCTGTGGTC AAGATGGTCA ACCTGATCAT CATGGACGCC
ATCCGCAAGA AGGCGTCGGA TATCCATATT GAACCCTATG AAGAACTTTT CCGAGTCCGT
TTTCGTATCG ACGGTGTCCT GCAGGAGGTC ATGCGGCCGC CGATGCGCTT GCGCAATGCG
ATCATCTCCC GTTTGAAAAT CATGTCCCAC ATGGATATCG CCGAACGGCG ACTGCCCCAG
GATGGCCGGG TCAAGGTCCG GACCCCCGGA GGGTTGGAGG TCGAATTCCG GGTTTCGGTC
TTGCCTCTTT TGTACGGGGA AAAGGTGGTC ATGCGCCTGT TGGACAAGAG TTCGCTCAAT
CTTGACCTGC GGGATCTGGG GTTGGAAGAC AGCGCCTTGG AGATCCTCCA GCGCGCCATC
ATCAAACCGT ACGGAATGAT ACTGGTTACC GGCCCAACAG GCAGCGGAAA GACGACCACC
TTGTATTCGG CGATCATGGA ACTCAACAAG CAGGAAGTGA ATATCGCCAC TGCCGAAGAC
CCGGTGGAGT ATAGTCTGGA AGGGGTCAAC CAGGTCCAGG TCCGCGATGA TATCGGCTTG
ACATTTGCCG GGGCCTTGCG CTCCTTTTTG CGTCAGGATC CGGACATTAT TCTTGTGGGT
GAGATCCGGG ATCTGGAAAC CGCCGAAATC GCCGTCAAAG CGGCTATGAC CGGCCACCTT
GTTCTTTCCA CCCTGCACAC CAATGACGCG CCACGGACTT TGACGCGCCT GATGAACATG
GGGGTCGAAG AATATCTGAT CGCTTCGTCG GTCAATGCCA TTGTTGCCCA GCGTCTGGTG
CGCAAACTCT GCCCCTTTTG CAAACAGGAC ACCGAGCTTT CGCAACCGGT CCTGGACGCC
CTGGGCATTG ATCCCGCCAC CTGGGACGAC AGCCAGGTCT GTGCCCCGCG TGGATGCCCG
AAATGCAACA ATACCGGGTA CAAGGGGCGC ATCGGTCTGT ACGAGGTCCT TGAAGTTACT
GAAACTATGC AGGAATTGAT CCTGCAGCGC GCCAGTGTCC CCCATATTCA CGCCCTGGCC
ATAGAAGAGG GGATGTTGAC CATGCGTCAA AGCGGTATCG AAAAAATCCG CCAGGGAATC
ACTTCGGCCC AGGAAGTGCT CAAAGTAACG GCGTAA
 
Protein sequence
MAKQTTTEAL KQWAQFTAEE IRDIEELQRQ KRTSFLAAAF DKDILRDQDY LEFLSQRLSM 
PCAAPELFDI AQDIFELVPS ELCRKYEAVP FFRHNNTLFI ATADPENLLA LDDIRFVTGM
ELAVHIATPT SIAVSLENYL KGEESGGNFG DLDEALADIA ESDVEISRKS EESASEEPSV
LEAASQAPVV KMVNLIIMDA IRKKASDIHI EPYEELFRVR FRIDGVLQEV MRPPMRLRNA
IISRLKIMSH MDIAERRLPQ DGRVKVRTPG GLEVEFRVSV LPLLYGEKVV MRLLDKSSLN
LDLRDLGLED SALEILQRAI IKPYGMILVT GPTGSGKTTT LYSAIMELNK QEVNIATAED
PVEYSLEGVN QVQVRDDIGL TFAGALRSFL RQDPDIILVG EIRDLETAEI AVKAAMTGHL
VLSTLHTNDA PRTLTRLMNM GVEEYLIASS VNAIVAQRLV RKLCPFCKQD TELSQPVLDA
LGIDPATWDD SQVCAPRGCP KCNNTGYKGR IGLYEVLEVT ETMQELILQR ASVPHIHALA
IEEGMLTMRQ SGIEKIRQGI TSAQEVLKVT A