Gene GM21_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4085 
Symbol 
ID8139459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4665565 
End bp4667208 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content61% 
IMG OID644871700 
Productflagellin domain protein 
Protein accessionYP_003023858 
Protein GI253702669 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones136 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA ATGACATTTC CCTTGTGGCC GGCATGAGGG ACAACCTGCT CGAATTGACC 
AAGACTGCCA AACTGACCAA CCGGACGCAG GAGCGCCTGG CTTCCGGCAA GCGGGTCAAT
TCCTCGTTAG ACGACCCCTC CAATTTCTTC GCCGCCATGG AATTGAAGAA CTACGCCGAC
GATCTTTCCC TGGTCCACGA CAACATTAAT AACACTATTC AGACGGTAAA GGCGGCTTCC
AATGGCATCA GCGCCGTCTC CACTCTCATT GCAGGTATGA AGAGTCTCGT GGACGCAGCG
AGGATAACGT CGGATCCCGC GGCGAAGGCA AAGTTCGCAA GGTCGTACGA TCAGCTTTAC
TCGCAGATCA ACTCGGTGGT ATCTGACAGC TCCTACAATG GCGGCAACCT GATCAGCGAC
AACGACCCGA GGCTCCTCAG CTGGAGCTCT CAGAACCTTC CGGTAAGCCT GAACCCAACC
GGCACCTCGG AGTACGTCGT GGAAGGAAAG TTCCTGGGTG CCGGCTACGC CTTCTACGAG
GACGGAGTCC CCGCCACAGG ATGGGTCCCC AATGACCTCG GGACCAAACT GGCCCCGGTT
GACGCAGATG CGACCGAGTT CCCTGTTCCC GGCAACGCCC CCACGCCCCT CGATTTCGTC
TTCATCATCG ACTCCACGGG AAGCATGGGC GGCTACATCA ACATGGTGGA AGCGAACGCC
AAGTCCTTCG TGAGCAACAT GGTAGCCCAG GGCGTCGACG GCAGGTATTC CTTCGTGAAG
TACGGAGACG TCTCCTCGGG CGACGCCGCC GTCATCGCGG CTCCCTCTTT TTTCACCGAC
CCTGACGCCT TCGCCGCCGC GATCACCGCG TCTGCCGACA GCCCTTCGGG GGGCGGAGAC
TTCCCGGAAT CCGGCCTTGA GGCGATACTT GGGGCTGTTT CCGGGCTTGC CTTCCGCGCG
GAGGCGACCA AGCGTATGGT GCTGCTGACC GACGCCACGG TGCATACCTT CGCCGATGGA
TTCTCCGCCG AGACCATCGC CGGAACCGCC GCCGCAGTCT CAGCAGCGGG AATCCAACTC
GACATCGCCA CGGTCGCCGG TGGCGTGACG CAGTTGACGC CGCTTGCCTC CGCGACCGGC
GGTACCGTCT ACGATATCAA CGATGCCTCC TTTTATACCG ACAACTTCGG CGTGACGCCC
GATCCCGCCA CCCCGACCCA TAACCTGACC GTCGTCTCCG CCGACTACGA TACCGACGCG
TTCTCCATCC GCTTCGATCC GCCGGGGGCG ACGAAGTCGA TGACGGCTGA GAAATCGGGG
TTGGGGCTGT ACCACAGTTG GGTGTATCAG AATTTCGCTA CCGAGGAGGG GCTCGCCGCC
GCGACCCAGG CCCTTGACGC CGCCGACCGC ATCCTCCGCA CCGAATCCGC CAACTTCGGC
TCCGGCTCGA CCATACTGGT GACCCGGGAC ACTCTGGCAT CCGAGATAGC CAACATCGTC
CGGACCGGCA GCGACAACCT TACCGCGGCG GATATGAACG AAGAGGCCGC CAACATGCTC
ATGCTGCAGA CCCGCCAGAG CCTTTCGACG ACCTCTTTAA GCATCGCCTC CCAGTCCTCG
CAGATCGTTT TGAAGCTTTT CTAA
 
Protein sequence
MAANDISLVA GMRDNLLELT KTAKLTNRTQ ERLASGKRVN SSLDDPSNFF AAMELKNYAD 
DLSLVHDNIN NTIQTVKAAS NGISAVSTLI AGMKSLVDAA RITSDPAAKA KFARSYDQLY
SQINSVVSDS SYNGGNLISD NDPRLLSWSS QNLPVSLNPT GTSEYVVEGK FLGAGYAFYE
DGVPATGWVP NDLGTKLAPV DADATEFPVP GNAPTPLDFV FIIDSTGSMG GYINMVEANA
KSFVSNMVAQ GVDGRYSFVK YGDVSSGDAA VIAAPSFFTD PDAFAAAITA SADSPSGGGD
FPESGLEAIL GAVSGLAFRA EATKRMVLLT DATVHTFADG FSAETIAGTA AAVSAAGIQL
DIATVAGGVT QLTPLASATG GTVYDINDAS FYTDNFGVTP DPATPTHNLT VVSADYDTDA
FSIRFDPPGA TKSMTAEKSG LGLYHSWVYQ NFATEEGLAA ATQALDAADR ILRTESANFG
SGSTILVTRD TLASEIANIV RTGSDNLTAA DMNEEAANML MLQTRQSLST TSLSIASQSS
QIVLKLF