Gene GM21_3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3199 
Symbol 
ID8138551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3712061 
End bp3713167 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content62% 
IMG OID644870804 
Productvon Willebrand factor type A 
Protein accessionYP_003022984 
Protein GI253701795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value0.930737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TTCAGGGGCA ACCCAAGGGG CAGATACTGC TAGTCGTCGC CGCGGTGATG 
TTCTTCGGCA TCTTCCTCGC CGCCCTGGCG GTCGATGCCG GCAGGGCATA TGGGGTGAAG
GCGAAGCTCC ATGCCGCGGT GGACGCTGCG AGCTATGAGG CGGCCAAGGC CCTGGCGCAT
GGGGAAGATG AGGACGACAT GGAGGAAAAG GCGAGCGAGG CAGCGCGCGA CTACTTCAGG
GCGAACTTCC CTGCTGCCTA TTTCGGCGCG CAGTGCAGCG GACCGGAACT GGAGCTGAGC
GAGCGGAAGT CGGGAAAAAA GATGCGTGCC CTCACCGTTT CCGCCACAGC GACGCTCCCC
AACGTCTTTG CCGGGATCCT CGGCTGGGAC AGCATCGATC TTCCTGCCCA GTCGCGCGCG
GTCAGGACGG ATGCCGACGT GGTGCTGGTG CTGGAGTCCT CGGACGTGCT CAGGGAGTCT
TTTCCTCAGG TGAAGCAACG GGTGGCTAAC TTCAGCGACC GCTTCAGCCA ACACTATGAC
CGGATGTCGC TGGTGACCTT TGCCGCGGGA GCCGACCCGG TCATTTCCAT CTGTGGCGTC
TATAAGCCCG CAAAGGACCG CCCCGGAGCG GGGACCTTCA ACTGCGGCAG TGGGTATCAG
AAAAGCAATT TCGCCAAAGC CCTTCTGGAG TTGAGCCCGC AGACGACCGG CGCAGCAGCG
GCTCCCGAGG AGGCGATGAA GCAGGCACAG GCCCAGTTGG ACGGCCTGAG TAGCGACCTG
CGGGCGGAGA AGAGGGCGAT CGTGCTTTTG GCAAGCGATG TAGCCGCAAG CAACATTAAA
GAAGAGACCG CCGCTGCGGC TCGCAAGGAG CAGGTGTTCA TTTATGCGGT GGAGATAGCG
GGGTCCCTTA AGGCAAGCAC ACCTGCGGGG GGCGCTAACG GCCGGAGCGG GAGCGAAAAC
ATGAAGCTGT TCGCCAACAC CAAGGACTCC GGAGGCCACG AAAAGGGACA ACCGACCGGC
TCGTACTGCG CAGCGACCGA CCTGCAGCAG TTGGAGCTGT GCCTGGAAAA TATCGCCAAC
GGCATGACCG TGAGCATTGA GCAGTAA
 
Protein sequence
MKILQGQPKG QILLVVAAVM FFGIFLAALA VDAGRAYGVK AKLHAAVDAA SYEAAKALAH 
GEDEDDMEEK ASEAARDYFR ANFPAAYFGA QCSGPELELS ERKSGKKMRA LTVSATATLP
NVFAGILGWD SIDLPAQSRA VRTDADVVLV LESSDVLRES FPQVKQRVAN FSDRFSQHYD
RMSLVTFAAG ADPVISICGV YKPAKDRPGA GTFNCGSGYQ KSNFAKALLE LSPQTTGAAA
APEEAMKQAQ AQLDGLSSDL RAEKRAIVLL ASDVAASNIK EETAAAARKE QVFIYAVEIA
GSLKASTPAG GANGRSGSEN MKLFANTKDS GGHEKGQPTG SYCAATDLQQ LELCLENIAN
GMTVSIEQ