Gene GM21_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1633 
Symbol 
ID8136964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1902423 
End bp1903643 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content61% 
IMG OID644869246 
Producttype II secretion system protein 
Protein accessionYP_003021446 
Protein GI253700257 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones183 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT TCGATTGGGA AGCAAGGAGC AAGGCCGGAA GCACCCAGAA AGGGGTGATG 
GAGGCCGGAA ACGCGGCCCA GGTCGAGTCG CAGCTCAAAA GATACGGATT TTCCAGCATC
ACTGTGAAGG AGCAGGGGAA GGGCTTCAGC ATGCAGCTCA AGCTCCCGGG TACCGGCGCG
AAGAAGATAG AGACCAAGGA TCTCGTGGTC TTCACGCGCC AGTTCGCCAC CATGATCGAC
TCCGGCCTCC CCCTGGTGCA GTGCCTGGAC ATCCTGTCGG GACAGCAGGA GAACAAGACC
TTCAAGGAGA TCCTGGTCAA GGTGAAAGAG AGCGTGGAGA GCGGCTCCAC CTTCGCCGAT
GCCCTCGCCA GACACCCCAA GGCCTTCGAC CAGCTTTACG TCAACCTGGT CGCCGCCGGC
GAGGTCGGCG GTATCCTCGA CACCATCCTT GCCCGGCTCG CAGCCTATAT CGAGAAGGCG
ATGAAGCTGA AGAAGCAGGT GAAGGGGGCT ATGGTGTACC CGATCACCAT CATGTCCATC
GCGGTCATCG TCGTCGGCGT CATCCTGGTC TTCGTCATCC CCACCTTCGC CAAGATGTTC
GCCGACTTCG GCGGCGAGCT CCCGATGCCG ACCAAGATCG TCATCGCCAT GTCCGACTTC
CTGACCAAGT ACCTCGTGGT CATTATCGCC ATTCTCTTCG GCATCAAGTG GGCCATCGGC
AAGTACTACC AGACCCCCGG CGGCAGAAAG AACATCGACC GGCTAGCGTT GCGGACCCCC
ATCGCCGGCC CGCTGATCCG GAAGGTGTCG GTAGCGAAGT TCACCCGTAC CCTGGGGACC
ATGATCAGCT CCGGCGTCCC CATCATGGAC GGCTTGGAGA TCGTGGCCAA GACGGCGGGT
AACAAGATCG TCGAAGAGGC GATCTACAAG GTGCGCCAGT CCATCTCCGA GGGGAAGACC
ATCGCCGAGC CTTTGGCCGA AAGCGGGGTC TTCCCTCCGA TGGTGGTGCA GATGATCTCC
GTCGGCGAGG CGACCGGTGC CATGGACGCC ATGCTCAACA AGATCGCGGA TTTCTACGAC
GACGAGGTCG ACGACGCGGT CGGCGCCATG ACCTCGATGA TGGAGCCGCT TTTGATGGTG
TTCCTGGGAA CCACGGTCGG CGGTCTGGTC ATCGCGATGT ACCTGCCGAT CTTCAAGCTC
GCCGGCGCGG TCGGCGGTTG A
 
Protein sequence
MPKFDWEARS KAGSTQKGVM EAGNAAQVES QLKRYGFSSI TVKEQGKGFS MQLKLPGTGA 
KKIETKDLVV FTRQFATMID SGLPLVQCLD ILSGQQENKT FKEILVKVKE SVESGSTFAD
ALARHPKAFD QLYVNLVAAG EVGGILDTIL ARLAAYIEKA MKLKKQVKGA MVYPITIMSI
AVIVVGVILV FVIPTFAKMF ADFGGELPMP TKIVIAMSDF LTKYLVVIIA ILFGIKWAIG
KYYQTPGGRK NIDRLALRTP IAGPLIRKVS VAKFTRTLGT MISSGVPIMD GLEIVAKTAG
NKIVEEAIYK VRQSISEGKT IAEPLAESGV FPPMVVQMIS VGEATGAMDA MLNKIADFYD
DEVDDAVGAM TSMMEPLLMV FLGTTVGGLV IAMYLPIFKL AGAVGG