Gene GM21_4128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4128 
Symbol 
ID8139502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4716827 
End bp4718107 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID644871743 
Productphosphate-selective porin O and P 
Protein accessionYP_003023901 
Protein GI253702712 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3746] Phosphate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.611373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAGC ACACGGCAGT GGCAGTGGCA GGTTTTCTGG CGGCGATGGC GGTTGGGCAC 
GACGTCCAGG CGAAGAGCCT TGAGGACATC CTGAAAGAGA AAGGTGTCAT TACCGAGGCG
GAGTACAAGG AGGCGTCGAA AGCGAAGCCT TACGATTACA AGCCGGGCAA GGGTTTCGTT
TTCACTTCGC CGGACCAGAA GTTCCAGGTG CAACTGGGGG GACAGATCCA GGTGCAATAC
GAACTCGACA ACTACGAAGC GTCCAACAAG CAGGACGTGA GCCAGTTCAA CCTGCGGCGT
GTCAAGACCC TCCTGAGCGG CTACGCCTTT ACCAAGGACC TCACCTACAA GGCGACCTAC
AACTGGGCCA ACGTGGTGAA GGACAACACC AAGGCCATGG AAGAAGTGAA CATGAAGTAC
CGCGTCGCCG ACGAACTGAG GGTCATGCTG GGGCAGGAGA AGATCCAGTA CTCCAGGCAG
TGGCTCACCT CCAACACGGC GCAGCAGTTC GTCGACGGTT CCTTCGTGAG GAACGCCTTC
ATGCAGGGGT ACGATACCGG CATCAACCTG CACGGCGATC TCTGGAAGGG GGTGGTGAAG
TACGATGCGG GGCTGTTCGG AGGCGCCGGC CAGAACACCA AGAACAAGAC CAACGACAAC
GCCTACAACT TCAGGCTGGC GTTCAATCCC CTGGGTGACA TGAAGTACGG CGAGGGCGAC
CTGGAGCACT CCGTGAAACC CCTGGTTTCC ATGGGAAGCA GCTACTACCT GAGCACGTTG
AAAAAGACCG TCTCCGGAAC CGGAACCACT GCGACCTCCG CCATCGACAA CAGCAAGTCC
AACTTCGTGA CCGACAGCAA CGGCTGGCTC GGCCAGGCGG TGAAAGGGAA GTATTTCGGG
ACTGCCGCCG CCGAGAAAAT CTCCGTGGAT TCCTGGGAAG CGGACTTCGC CTGCAAGTGG
CTGGGCGCCT CCATGCAGGG CGAGTACTTC TGGGGCAAGG CCCAGGGCGA GGCCTCGGGT
AAGGAACTGA TCGCGAAGGG GGCCTACGTG CAGGCCGGGT ACTTCGTGAT CCCGAAGCGC
CTGGAGCTTG CGCTCCGATA CGCCTGGATG GATCCCAACC GCGGGCTTGC CAACGACGCC
GTTTCCGAGA TCCAGGGAGG GGTCAACTAC TTCCTCTACG GCAACAACCT GAAGATCCAG
GGCGACGTGG GCAACCGCCA CACCTACAAG AACAAGTCCG ACGACCTGGT GGCGCGCGCC
CAGGTGCAGC TGCTCTTCTA G
 
Protein sequence
MRQHTAVAVA GFLAAMAVGH DVQAKSLEDI LKEKGVITEA EYKEASKAKP YDYKPGKGFV 
FTSPDQKFQV QLGGQIQVQY ELDNYEASNK QDVSQFNLRR VKTLLSGYAF TKDLTYKATY
NWANVVKDNT KAMEEVNMKY RVADELRVML GQEKIQYSRQ WLTSNTAQQF VDGSFVRNAF
MQGYDTGINL HGDLWKGVVK YDAGLFGGAG QNTKNKTNDN AYNFRLAFNP LGDMKYGEGD
LEHSVKPLVS MGSSYYLSTL KKTVSGTGTT ATSAIDNSKS NFVTDSNGWL GQAVKGKYFG
TAAAEKISVD SWEADFACKW LGASMQGEYF WGKAQGEASG KELIAKGAYV QAGYFVIPKR
LELALRYAWM DPNRGLANDA VSEIQGGVNY FLYGNNLKIQ GDVGNRHTYK NKSDDLVARA
QVQLLF