Gene GM21_2679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2679 
Symbol 
ID8138021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3113445 
End bp3118328 
Gene Length4884 bp 
Protein Length1627 aa 
Translation table11 
GC content60% 
IMG OID644870283 
ProductTfp pilus assembly protein tip-associated adhesin PilY1-like protein 
Protein accessionYP_003022473 
Protein GI253701284 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0000064285 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACGCT TACTGGTCAT GCTTTTCACC GTTATCACCC TCGCCTGGTC GGCATCTGTC 
GCACAGGCAG ACCCCGCAGT AGTCACCACC AGCCCCGTCA ATGGTGCCGT CAACGTGGAT
CCCACAAGCC TTGACACAAT ATGGGTCGGG TTCAACGACT ACGACATGGA CAAGAGCAGC
TTCACGGGAA GCAACGCCGG GAGGGTTTCC GTCAACAACG GGGCCACCTA CACCGTGAGC
AAGGTGGACG GCAGCTTGCC CTGGAACGGC GATTTCATCA AGATCTCGCT GAGCAAAGAC
CTCAATTACA GCACCACCTA CACCGTCATC ATCGACTACC GAGTCAAAAA CAAGGAGGGT
GAAAGGCTTG GTTCGAGCAG CGCCTCCAAC TACGTTTTCA GCTTCACAAC CAAAGCGAAA
CCAAGCTCCG ACACGACTGC GCCGGTGGTA GATTCCACCT TTCCCGTCAG CAGTGCCGTC
GACGTTCCCA TCACCGCGGC GATAACTGTG ACCTTCGATG AGGTGATGAA AGCCGATACC
ATCAACAGCA CCAATTTCCT CATCAACAAC GGCGCGGTCG CCGGGGCAGT CACCCTCGAC
GCCAGCGGTA AGGTCGCCAC CTTTACCCCG AGCGCCAACC TGAGCTCGTT CACCACCTAC
ACCGCCACCG TCACCACAGG CGTACGCGAT GCGGCAGATA ACGCGCTTGC CAGCAACTAC
TCCTGGAACT TCCGCACCAT GGCGCTCGAC AGTGTGCGCC CAACCGTCAC CGTCGTCAGC
CCCATCGCCG GCGCCACCAG CGTCGATACC ACCACAGTGA TAACTGCGAC CTTCAGCGAG
GCGATGAGAG ACACCACCAT CACCAGCGCC AACGTCTCGG TCAGCGGCGG GGTGACCGGC
ACGGTGAGTT ACGATCCGGC GACCTGGACC CTCTCGTTCG CGCCGACCGC GGCTCTCGCC
AATTCGACAA ACTACACGGT GACCATTTCC ACCGGCGTCA CCGACCTCGC CGGCAACTCC
CTGTTAACGG CGAAGAGCTG GACCTTTACC ACCAGGGCCG TCACGACGCC GCCGCCTTTG
AACGATTATT GCCAGGTCCC TCCCTACGTC ACAAGCACCA ACAACATGGT GAAGCCCAAC
GTCCTGCTCG TGGTCGACAA CTCGGGGAGC ATGTACGAGT TCGCCTACAA GAACAGCGGC
GCAGGCAACA ACTCGTACGA CACCAGCTAC ACCCCCGCGA AGAGTTACTA CGGGTATTTT
GACGACAAAA AGATGTACCT CTACTCAGGG GGGGCATTCG TTCCCGACAC CGCCGCCTCG
ACGGTCACAG ACACCACTAA ATTTTTATCC GGCAACTTCC TCAACTGGCT CACCATGAGG
CGAGTCGACG TGGTCCGTAA GGTACTTGTG GGTGGCAAGA TGACCCCGCG CGTCGGCGCC
GGCCGGTACC TCTATCCTGC CGGCTCCCCC GATCGCGACT TCTACAAGAG CTACAACAAC
GTGAAGTACA CCGTCCAGGG GGGCGCCTCC ACCGAGGTGA TCAAGGACAC CACCAACAAC
GTGACCTACA ACCTGAAAAT CGCGATTGGT GACGAGGGGA GCAATCAGGA CGAAGGGCTG
GTGCCGAAGT ACGCCAACAT GATCAACTTC GGCATCATGT TCTACAACGA GGGCTACAAG
TACGAAAACT CGGTCAACAA CGTGCGCGAC GGCGGGTACG TCGCCGCCGA CCTGGGCAGC
ACCGGAAGCA ACCTGGTCAC GCAGATCGAG AGCACCGATC CCACCACCTG GACCCCCCTC
GGCGAAACCC TTTTCGAGGC GACCAGGTAC TTCCAGGCAG GATCCAGTGC CTATAACGGC
GGGACCTACT CCGGCAAAGA CCCCATCAAC TACGCCTGCC AGAAAAACTT CGTGCTGATC
CTGACCGACG GGGAATCGAC CAAAGACGAG AACATCCCGG GCGGCTCGAC CAACTTTTCC
GGAAAGGTAA CCGACAGCAG CTTCAACGTG AAGACCTGGA TGGACAGCAT CGCGACTCAG
GAAGGGTATG CCAGCCAGTA CAGCTCCAGT GCCAATACCA GCGAAGGGAC CTACTACTTG
GAGGGGGTAG CCTACTGGTC CCACGTGACC GACATGCGTT CGGCCTCCCT CGGCGACAGC
GACATCCCCG GCAAGCAGAA CCTCACCATC TACACGGTCT TCGCCTTCGA CGACTCGCCC
GTAGGCCGGG ATCTTCTGAA AAAAACCGCC AAGTACGGCG GATTCAACGA CTACGACTCG
ACCGGAAAGC CGGACAAGGT CGCCAAGTGG GACCAGGACG GCAACGGGAT CCCTGACACC
TATTACGAGG CCTCCGACGG CGCTGCACTC GCGGCCTCGC TGCAGAAGGC GTTCAACGAC
ATCCTGGCGC GCGTCTCCTC GGGTACCGCC GCCTCCATTC TCAGTAACAG CGAGGGGAGC
GGGGCGAACA TACTGCAGGC CGTGTTCCAC CCGAGGAAAT ACTTCGATGC CCAGACCTCC
GCCGACTGGA TCGGGGAGAT GCACAACATG TGGTACTTCG TCGATCCCAA GATCAAAAAC
AGCTCCATCC GCGAGGATAG CGACTACATC CCCGGGAGCC CTGCGCCGCC CCACTACCTG
AACCTCAGTA AAGACAAGTT GATCAACTTC TACTTCAACA CCGACCAGGC CAAGACGATG
GTCAAACGTT ACACCGACGT CAGGGGGGAC GGCCAACCTG ACCTCGACAC CAACGGCGAT
CTCAAGGCGG ATTCCTACAC CCCGTACGAC GAGGTCGACT CCGACAGCGT AAAGAGCATC
TGGAAGGCCG GAAAGCAGCT CTGGAGCCGC ACCGCCGCAA GGAATATCTA CACGAATCTC
GCCGGGAGCC TCACCAGCTT CACCGGTCTG GACACCACCG ACGGCAATAT CCAGCAACTC
CTGCAGGCGG CGAACAAGAC CGAAGCGGAC AAGATAATCT CCTACATCGC CGGCACCGAC
CAAAGCGGGT ACCGCAACCG GACCGTCAAC ATCGGCGGCG TCACCGGCAC CTGGCGACTG
GGGGATATCG TCTCCTCCAC GCCAAGGCTG CAGTCGTCGG TCAAGCAGAA CGTCTACAAC
ATGCTTTCGC CGAAGGGTTA CGGCGACAGG TCTTACGGGG ACGACTACAC CAGGAAGGGG
TACATCTACA CGTCTTCCTA CACCAACCGC GGCATGGTGT ACGTGGGCGC CAACGACGGG
ATGCTGCACG CCTTCAAGCT GGGTCTTCTC GATGTCACTG CCTCCGGAGA CCGTAAAGGG
AAGCTCTCCG GAGAAGATCT GGGTGAGGAG CAGTGGAGCT TCATTCCGAA GAACGCGCTG
CCGTACCTGA GGTACAACGC CGACCGGGAC TACAACCATA TTTACTACGT GGACGGCACC
ACCGTGATCA ACGACGTCAG CATGGGAACA CCGGCGGGCT GCACCGACAA CTACTGGGAC
TGCACCAGGG ACGTTGAGAA CGGGAGCAAC TGGCGCACCG TCCTCGTCTC GAGCATGGGG
CTCGGAGGTG CGTCGAAGAT CGCAGGCTCC GGCTGCAAGG GGACCACCTG CGTGGAAACC
CCGATCACTG ACCCCGCCAA CGCCGGCGAG GGGGTGGGTT ACTCATCGTA CTTCGCCCTC
GACATAACCA ATCCGGACAG CCCTTCGCTT TTATGGGAGT TCGGCAAGCC GAACCTCGGC
TTCTCGACCA ACGGCGCCGC CTTCGTTAAG ATCAGCGCCA AGAAGGCGGA CGGGGTTACC
CCGGACCTCA CCAAAAACGG GAAGTGGTTT GCGGTCCTCG CCTCCGGGCC TACCGGCCCC
ATCGACACCG CGCTGCACCG GTTCAAGGCG AGTAGCAACC AGAACCTGAC CATCTACGTC
CTGAACCTCG CGGACGGGTC TATCGCAGCC ACCATCGACA CGCTGGCCGA CGGGACCAAG
CTAAGCAACG CCTTTGCCGG GACCATCACC AACGCGACGG TCGACACCGA CCGCTGGAAC
AGGAACGCAC GTGGGCACTA TGAGGACGAC GCGCTCTACA TCGGCTACAG CCAGCTTTCC
GGCACCGACT GGACCGGAGG CGTCTTGCGT CTGATGACCG GGGAGAACCT CGACCCGACA
ACCTGGAAGC TCAGCAAGGT GATCGACGGG GTAGGGCCGG TGACCACCAA CTTGACCAGG
CTGCAGGACC GTCAGAAGCA CAACCTCTGG CTCTACTTCG GCTCGGGCAG GTACTACTAC
AGTCAGGACG ACAACGACGC CACCAGGCTC ATCGCTGCAG TGAAGGATCC TTGTTACGAC
TCCGCCAATG ACAAGCTCAT TCCGACCTGC ACCACCAAGA AGAGTCTCTC CGACCTGACC
AACCAGTCCA CCTCCCGCAG CGCTGTGACC AATGACGGAT GGTACATCAA CCTCGCCCCT
GTCGACGCCA CCCATTCGTT CGGCGCGGAG AGGATGATCG CAGACCCGTC CGCGCTGGCC
AACGGGGTGG TCTATTACAC CACCTTCGCT CCCACCACGG ACCCCTGCAG CTTCGGGGGA
AAATCGTACA TGTACGGGCT GGATTACGAT AGCGGCAACG CTCCGGCGTG CAATCAGCTC
GGAGAGGGCA TAGCGCTGGT GCAACTCTCC ACGGGCGCCT TCCAGGAGAT ACACCTGAAG
GATGCTATAG GCTGCTACAA CAACAAGGGC GAGTTTGTTC CCCCGCTTCC CCCGACGGAT
ACACAGGTGA ACCCGGCGGG GTACACGCCT CCCCCCGGGA AAAGCTACTC CATGGTCGGC
AAACCGCCGG GCGACGCTTC GCCGATTGTT TCCGCCTCTG GCTTAGTACC GGCGAAACGG
ATCTTGCATA TCCTGGAGCA TTGA
 
Protein sequence
MRRLLVMLFT VITLAWSASV AQADPAVVTT SPVNGAVNVD PTSLDTIWVG FNDYDMDKSS 
FTGSNAGRVS VNNGATYTVS KVDGSLPWNG DFIKISLSKD LNYSTTYTVI IDYRVKNKEG
ERLGSSSASN YVFSFTTKAK PSSDTTAPVV DSTFPVSSAV DVPITAAITV TFDEVMKADT
INSTNFLINN GAVAGAVTLD ASGKVATFTP SANLSSFTTY TATVTTGVRD AADNALASNY
SWNFRTMALD SVRPTVTVVS PIAGATSVDT TTVITATFSE AMRDTTITSA NVSVSGGVTG
TVSYDPATWT LSFAPTAALA NSTNYTVTIS TGVTDLAGNS LLTAKSWTFT TRAVTTPPPL
NDYCQVPPYV TSTNNMVKPN VLLVVDNSGS MYEFAYKNSG AGNNSYDTSY TPAKSYYGYF
DDKKMYLYSG GAFVPDTAAS TVTDTTKFLS GNFLNWLTMR RVDVVRKVLV GGKMTPRVGA
GRYLYPAGSP DRDFYKSYNN VKYTVQGGAS TEVIKDTTNN VTYNLKIAIG DEGSNQDEGL
VPKYANMINF GIMFYNEGYK YENSVNNVRD GGYVAADLGS TGSNLVTQIE STDPTTWTPL
GETLFEATRY FQAGSSAYNG GTYSGKDPIN YACQKNFVLI LTDGESTKDE NIPGGSTNFS
GKVTDSSFNV KTWMDSIATQ EGYASQYSSS ANTSEGTYYL EGVAYWSHVT DMRSASLGDS
DIPGKQNLTI YTVFAFDDSP VGRDLLKKTA KYGGFNDYDS TGKPDKVAKW DQDGNGIPDT
YYEASDGAAL AASLQKAFND ILARVSSGTA ASILSNSEGS GANILQAVFH PRKYFDAQTS
ADWIGEMHNM WYFVDPKIKN SSIREDSDYI PGSPAPPHYL NLSKDKLINF YFNTDQAKTM
VKRYTDVRGD GQPDLDTNGD LKADSYTPYD EVDSDSVKSI WKAGKQLWSR TAARNIYTNL
AGSLTSFTGL DTTDGNIQQL LQAANKTEAD KIISYIAGTD QSGYRNRTVN IGGVTGTWRL
GDIVSSTPRL QSSVKQNVYN MLSPKGYGDR SYGDDYTRKG YIYTSSYTNR GMVYVGANDG
MLHAFKLGLL DVTASGDRKG KLSGEDLGEE QWSFIPKNAL PYLRYNADRD YNHIYYVDGT
TVINDVSMGT PAGCTDNYWD CTRDVENGSN WRTVLVSSMG LGGASKIAGS GCKGTTCVET
PITDPANAGE GVGYSSYFAL DITNPDSPSL LWEFGKPNLG FSTNGAAFVK ISAKKADGVT
PDLTKNGKWF AVLASGPTGP IDTALHRFKA SSNQNLTIYV LNLADGSIAA TIDTLADGTK
LSNAFAGTIT NATVDTDRWN RNARGHYEDD ALYIGYSQLS GTDWTGGVLR LMTGENLDPT
TWKLSKVIDG VGPVTTNLTR LQDRQKHNLW LYFGSGRYYY SQDDNDATRL IAAVKDPCYD
SANDKLIPTC TTKKSLSDLT NQSTSRSAVT NDGWYINLAP VDATHSFGAE RMIADPSALA
NGVVYYTTFA PTTDPCSFGG KSYMYGLDYD SGNAPACNQL GEGIALVQLS TGAFQEIHLK
DAIGCYNNKG EFVPPLPPTD TQVNPAGYTP PPGKSYSMVG KPPGDASPIV SASGLVPAKR
ILHILEH