Gene GM21_2665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2665 
Symbol 
ID8138007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3101662 
End bp3104442 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content62% 
IMG OID644870269 
Producttype IV pilus secretin PilQ 
Protein accessionYP_003022459 
Protein GI253701270 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.000377004 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAAGGT ACCAGTGCAG AATTCTATGT CATGCCGTGG CACTCATCGT TTTGTTGACC 
GTTTCAGCAG GATGCGTGAA ACGGATGACC GCTGCGAACG AGTCTGCCCA GGCAGAGGCC
TCCGCCTTTG CGACCCTCAG GTCGGTCACG GTGTCGGCCG ACGCCACCAG CGTGGAGCTG
GTCAGCGACA AGCCGATCAC CTATACCTCC TACAAGGGGG GGGATCCGAC CCAGATCATC
GTCGACATCT CCCAAACCGA GCCGGGGGCT GTGACTTCTC CCATCGAGGT GAACCGCGGC
AACATCAAGC GGATCGAGGT CGAGCGCCAA CCTTTGGGGG GGAGCGTGCT GACGCATATG
ACGCTCGTTC TCACTAAGGA CGTCGACTTC GCGGTCGCCA CCGACCCCTC GGACAAGAGC
AGGCTCAAGA TCTACCTTCC GGTAGTGGAG CCCGAGGCGA AGGCTGAGCC GACCGAAGTC
AAGGAAGCCG CGCTTGCCGA GAGCAGGATC GACGAGAAGA CCCTGACGCC GGCTGCTGCT
ACGCCCGCCG TTGAGGCAGG CTCCGCTCCC ATCAAGGCAG AAGCCGTGCC GGCACCGCCC
GCCGCACCTG CCACGGAGTC TAAGGCAGCA ACCGCAGCTG TCGCAGAACC GAAAGCGGAT
GCCGCACCTT GGAAGGCTGG CGGCGCAGAA GGGGGGAACG GTCTCAATGC GGTCATCCCA
GGGGCGGACG GTGTGGATCT CTCCATCCAA GGGGGGGTAC AGACCTTCAA CTCCTTCAAG
TTGACCAAAC CTGACCGCAT CGTCCTCGAC CTCTTCAAGG TCAAGAACTC GCTGCCCCGG
AACGTGGTGC CTGTCAACGC CTTTGGCATC GCCAACGCCC GTGTGGGGTC GACTCCCGAC
AAGGTGCGAG TGGTGTTGGA CGCAGCGGGG GAGAGCCTTC CCCCGTACGA AGTGGTCAAG
AGCGACCTCG GGGTGAAGAT CAGGCTCAAG GGAAAGGCCA CGGCCATCGC CAAGGCTCCG
GCTGTTGAGG CCCCGGCAGC GGCGCCCGCC CCGGTTCCTG CGCCTCCCGC AGCGAAACTG
AAGACCGAGG TTCCCCACTC CCGCCAGACC AAGGGGGCGC TGGAAGGAAT CGAATTCAAG
GTGGTCGGCG GCGTCTCCCG CGTTTCCATG AAACTCTTCG GGACCTGCGA ACCCGGGCAG
CCGTTGCGCG GCCCCCAGGG GGTAACGCTC GGCATCGCCA ACTGCCAGGT ACCCAAGCAA
CTGCAGCGCG CGCTCGACAC CACCAAATTC GGCACGCCGG TGCTCTCGGT AACGCCTTAC
CAGGTGAAGG TCAAAGGGAG CACGGTTACC AAAGTGTTGG TGAAGTTACG CGGCAACCCG
GAATTCAGCA CCAGCCGCAA GGGAGACCTG CTCCTTTGGG ATTTCGTAAA CCCGGGTCCT
GTAGCTGCTC CCAAACTCCC TGCCGCACCG CCTGCTCCCA AGGCCAAGGC TCCTGTCGAG
CCCAGGGTCG CCGAGGAACT AAGCCCGCCG CCTGCGCGCA GTAGCGACGA CATGGCGGTT
CAGCTTCCCA GCGACCGCCC GGCCAAGAAG GTCTACACGG GGAGGAGGGT GACCCTCGAG
TTCTCCGACG CCGATGTCAG GAAGATCTTC CAACTGATCG CCGAGGTGAG CAACCTAAAC
TTCCTCATCG CCGATGACGT CACCGGGACC ATCAGCATAA AGCTGGTGAA CGTCCCCTGG
GACCAGGCGC TCGACGTGAT TCTGGATGCC AAGGGGCTCG CCATGATACG CGAGGGGAAC
ATCGTTCAGA TCAAGCCCAG GTCCAAGATG CAGAACCAGG CCGACGAGGA ACTCGCGGCT
AAGAAAGCGG CTGAGCGCCT GATGGAATTG AGGACCACGG TGTTCGAAGT GAACTACGCC
TCGGTGTCCG ACGTGGCGAC CCAGTTCGCC ATGCTAAAGA GCGATCGCGG CGTAATCACC
AAGGACGAGC GCACCAGCCG CGTGATCGTG AAGGATATCC AGACCGCGCT GGACGACATG
AAGGCGCTCC TGAAAACTCT GGACGCCCCC GAGAAGCAGG TGATGATCGA GGCCCGCATC
GTCGAGGCGA CCTCGACCTT CACCCGCGAC CTCGGCGTGC AGTGGGGGCT CAGCTACCGC
GACGGCTCCG CTTCCATGGC TGGCATCAAC TCAGTCGACA CTGGTTTCGG CGGCGTCGTT
TCCGCGGCAG GGCCGGGTGC AACCGGCGCT GGTGGGCTCG GACTCGGCAT GTCCTTCGGC
AAGCTGACCA GCAACATCAA GCTGGACATG AGGCTCGCGG CGGCCGCGAC CATCGGGCAG
GTGAAGATCA TCTCCACCCC GAAAGTGGTC ACCCTGAACA ACAAGGCGGC CAAGATCTCA
CAAGGGCAGT CCATACCGTA CCAGACCACT TCCGCCGAAG GGACCAAGAC CGAGTTCGTG
GAAGCGGCGC TGACCCTTGA GGTGACCCCG CACATAACCG CAGACGGCTC GGTCAGCATG
AAGATCAAGG CCAGCAACAA CTCGCCTGGC ACCGGGTCGC CCCCTCCTAT CAACAAGAAG
GAGGCGACCA CCGAGCTCGT GGTCTCCAAC GGCGACACCA CCGTCATCGG CGGCATCTAC
GTCGACAGCG AGACCGAATC CGATACCGGC GTCCCGTTCC TCTCGGACAT TCCCCTTTTG
GGTTGGCTCT TCAAATCCAA CGCGAAGCAG AAGACGAAGA CGGAACTGCT CATTTTTATT
ACGCCAAAAA TAATTTTATA G
 
Protein sequence
MTRYQCRILC HAVALIVLLT VSAGCVKRMT AANESAQAEA SAFATLRSVT VSADATSVEL 
VSDKPITYTS YKGGDPTQII VDISQTEPGA VTSPIEVNRG NIKRIEVERQ PLGGSVLTHM
TLVLTKDVDF AVATDPSDKS RLKIYLPVVE PEAKAEPTEV KEAALAESRI DEKTLTPAAA
TPAVEAGSAP IKAEAVPAPP AAPATESKAA TAAVAEPKAD AAPWKAGGAE GGNGLNAVIP
GADGVDLSIQ GGVQTFNSFK LTKPDRIVLD LFKVKNSLPR NVVPVNAFGI ANARVGSTPD
KVRVVLDAAG ESLPPYEVVK SDLGVKIRLK GKATAIAKAP AVEAPAAAPA PVPAPPAAKL
KTEVPHSRQT KGALEGIEFK VVGGVSRVSM KLFGTCEPGQ PLRGPQGVTL GIANCQVPKQ
LQRALDTTKF GTPVLSVTPY QVKVKGSTVT KVLVKLRGNP EFSTSRKGDL LLWDFVNPGP
VAAPKLPAAP PAPKAKAPVE PRVAEELSPP PARSSDDMAV QLPSDRPAKK VYTGRRVTLE
FSDADVRKIF QLIAEVSNLN FLIADDVTGT ISIKLVNVPW DQALDVILDA KGLAMIREGN
IVQIKPRSKM QNQADEELAA KKAAERLMEL RTTVFEVNYA SVSDVATQFA MLKSDRGVIT
KDERTSRVIV KDIQTALDDM KALLKTLDAP EKQVMIEARI VEATSTFTRD LGVQWGLSYR
DGSASMAGIN SVDTGFGGVV SAAGPGATGA GGLGLGMSFG KLTSNIKLDM RLAAAATIGQ
VKIISTPKVV TLNNKAAKIS QGQSIPYQTT SAEGTKTEFV EAALTLEVTP HITADGSVSM
KIKASNNSPG TGSPPPINKK EATTELVVSN GDTTVIGGIY VDSETESDTG VPFLSDIPLL
GWLFKSNAKQ KTKTELLIFI TPKIIL