Gene GM21_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1631 
Symbol 
ID8136962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1899559 
End bp1901265 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content61% 
IMG OID644869244 
Producttype IV-A pilus assembly ATPase PilB 
Protein accessionYP_003021444 
Protein GI253700255 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones171 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTAA GCAGACTGGG TGAGCTGCTG GTCAGCAACA GCCTCATAAC CAAGGAGCAG 
TTGAAGCAGG CTCTGGCCGA ACAGAAGGCC GCCGGTGGCC AATTGCGTCT GGGTTCGATA
CTGGTCAGGG AGAACCTGAT CAGCGAGGCC GATCTCACCT CCTTCCTTTC CAAGCAATAC
GGCGTCCCCA CCATAAATCT CGCGGACTAC GAGGTGGAGC CCTCGGTGGT GAAGATCATT
CCCGCCGAGA TCGCTCACAA ATACCAGATC GTACCGGTCA ACCGGGCCGG TTCCACGCTG
ATCATAGCGA TGAGCGACCC GTCCAACATC TTCGCCATCG ACGACATCAA GTTCATGACC
GGCTACAACG TCGAGGTGGT GGTGGTGGCC GAGAGTTCCA TCAAGGCCGC CATCGACAAG
CTCTACGACC AGTCCGCGTC TTTGGCCGAC GTGATGAACG ACCTGGAGAT CGACGATCTC
GAGGTGGTGG GGGACGACGA GGAAGTGGAC GTTGGCTCCC TGGAGCGCGC CACCGAGGAC
GCGCCTGTCG TCAAGCTGGT GAACCTGATC CTCACGGACG CCATCAAGAA GAAAGCCTCC
GATATTCATA TCGAGCCCTA CGAGAAGTAC TTCCGGGTCC GTTACCGCAT CGACGGCGTG
CTTTACGAGG TGATGAAGCC ACCCTTGAAG CTGAAAAACG CCATCACCTC CCGCATCAAG
ATCATGAGCG AACTGGACAT CGCGGAAAGG CGGCTCCCCC AGGACGGCCG CATCAAGATC
AAGCTGGGGG GGGGCAAGGA CATGGACTTC CGCGTCTCGG TGCTCCCCAC CCTGTTCGGC
GAGAAGATCG TTATGCGTCT TCTGGATAAA TCGAATCTGC AGCTCGACAT GTCGAAGCTC
GGCTACGAGC ATGAGGCACT GGCGCACTTC CAGCGCGAGA TCCACAAGCC GTTCGGCATG
GTGCTGGTCA CAGGCCCTAC CGGCAGCGGC AAGACGGTCT CCCTCTATTC GGCCCTCTCC
GAACTGAACA AGGTCACCGA GAACATCTCC ACCGCCGAGG ACCCGGTCGA ATTCAACTTC
GCCGGCATCA ATCAGGTGCA GATGCACGAG GACATAGGAC TCAACTTCGC CGCGGCCCTG
CGCGCCTTCC TGCGCCAGGA CCCGGACGTC ATCATGATCG GCGAGATACG CGACTTCGAG
ACCGCTGAGA TCGGCGTGAA AGCCGCGCTC ACCGGCCACC TGGTGCTCTC CACGCTGCAC
ACCAACGACG CCCCCTCGAC CATCAACCGC CTGCTCAACA TGGGGATCGA GCCTTTCCTC
GTCGCCTCCG CCGTCAACCT CATCTCGGCG CAAAGGCTCG CGCGCCGGGT CTGCAGCGAA
TGCAAGATCG TGGAAGAGGT CCCGCACCAG GCGCTTATCG ACGCGGGGCT TCCCCGCGAG
CAGGCCGAAA GCGCCGTCTG CTACAGGGGA ACCGGCTGTC CCAAGTGCAA CGGGACCGGG
TACAAGGGGA GGGTCGGCTT CTATCAGGTC ATGCCGATGC TGGAGCCGAT ACGCGAACTG
ATATTGAACG GGGCCAACAC GGCCGAGATC AAAAGGGAAT CGATGCGGCT GGGGATCAAG
ACGATGCGCC AGTCCGGTCT CACCAAGTTG GTGGAGGGGG TCACCTCCTT CGAGGAAGTG
CTGAGGGTGA CGGTTGCGGA CGACTAA
 
Protein sequence
MHVSRLGELL VSNSLITKEQ LKQALAEQKA AGGQLRLGSI LVRENLISEA DLTSFLSKQY 
GVPTINLADY EVEPSVVKII PAEIAHKYQI VPVNRAGSTL IIAMSDPSNI FAIDDIKFMT
GYNVEVVVVA ESSIKAAIDK LYDQSASLAD VMNDLEIDDL EVVGDDEEVD VGSLERATED
APVVKLVNLI LTDAIKKKAS DIHIEPYEKY FRVRYRIDGV LYEVMKPPLK LKNAITSRIK
IMSELDIAER RLPQDGRIKI KLGGGKDMDF RVSVLPTLFG EKIVMRLLDK SNLQLDMSKL
GYEHEALAHF QREIHKPFGM VLVTGPTGSG KTVSLYSALS ELNKVTENIS TAEDPVEFNF
AGINQVQMHE DIGLNFAAAL RAFLRQDPDV IMIGEIRDFE TAEIGVKAAL TGHLVLSTLH
TNDAPSTINR LLNMGIEPFL VASAVNLISA QRLARRVCSE CKIVEEVPHQ ALIDAGLPRE
QAESAVCYRG TGCPKCNGTG YKGRVGFYQV MPMLEPIREL ILNGANTAEI KRESMRLGIK
TMRQSGLTKL VEGVTSFEEV LRVTVADD