Gene GM21_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4122 
Symbol 
ID8139496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4705709 
End bp4706908 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID644871737 
Productphosphate-selective porin O and P 
Protein accessionYP_003023895 
Protein GI253702706 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.556284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATA TGCCAAAGTT ATCGACTGTC GCCGGGGCGA CCTTAGGGGT AGTCCTCATG 
GCGGGGACGG CCTTTGCCGG CCCGAAAATG GTCTTCGGCC CCAACGACGA GGGGGCGCTC
CAGATCGACT ACAAGGGGCA GTTCCAGATG ACCGTCCGGG ACATCGGGTC GGGCGAGAAC
AACGACGACA ACACCATGAA CTTCAACTTC CGCAGAAACC GTCTCGCCCT CATGGGGAAA
TACGGCGACA ACATGTCCAT CTACGTCCAG ACCGAGTACG TGGACGACGC CAACATCACC
CCGTTCGATG TGGCCGATAC CGACCAGGGT TCGGAGTTCC AGTTCCTCGA TGCGGTGATG
CGCTTCAAGA TCAACGACGC GCTGCGCGTG AACGTCGGCA AGTTCAAGTA CAACCTCTCC
CGCGAGAACC TTGAGGCATG CGAGATGCCG CTCACCCTGG ACCGCTCGCT CTTCATCCGC
GCCCCCTACA CCACGACCCG CGACACCGGT GTGGCCGTCT GGGGTAACCT CTTCGACGAC
ATGTTCCAGT ACCGCGTCGA TGCCATGGAA GGGCGCAAGG CCGTGTCCGG CGTCACCGCG
CCGGCCTCGA ACTTCAGGTA CTCAGCACGC GCTCACGTGA CGCTCCTCGA CCCGGAGAAC
GACTACGGCT ACAAGGGGAC CTATCTCGGC AAGAAGAAGG TGGCCACCAT CGGCGCCGCC
TACCAGTTCG AGCCTGAGGT CGCCTACGGC AACACGTTGA CGCAGACCGA CAAGAAGGAT
TACAAGGCCT GGACCGTCGA CGGCTTCGTC GAGTATCCGA TCGAAGGGGT GGGTACCGTC
ACCGCGTCGG CGGCCTACGA GGATGTCGAT CTGGACGACG CGTACCAGGG GGACAACCCC
GACTCACTGG TTACCGGCCT CAACGGCGAG AAGAACGGCT ATTACGTGAA GGGTGGTTAC
ATGCTCCCCA CCATGCCGCT GCAGTTCTTC GTCAGGTACG AGAGGTGGCG CTTTGCCGAG
TTGAACGGCG TCTTCGACCA GAGGATCGAC TGGTACGGCG GCGGGTTCAA CTACTACCTG
CGCAACCAGA ACCTGAAGCT CACCTTCGAG GCTAACTCTA CAGGCTTCAA CAAGGGTGGG
GGAACCGAGA CCACTGAAGA CTTCATGACC TACATAACGC AGCTGCAGCT TATCTTCTAA
 
Protein sequence
MLNMPKLSTV AGATLGVVLM AGTAFAGPKM VFGPNDEGAL QIDYKGQFQM TVRDIGSGEN 
NDDNTMNFNF RRNRLALMGK YGDNMSIYVQ TEYVDDANIT PFDVADTDQG SEFQFLDAVM
RFKINDALRV NVGKFKYNLS RENLEACEMP LTLDRSLFIR APYTTTRDTG VAVWGNLFDD
MFQYRVDAME GRKAVSGVTA PASNFRYSAR AHVTLLDPEN DYGYKGTYLG KKKVATIGAA
YQFEPEVAYG NTLTQTDKKD YKAWTVDGFV EYPIEGVGTV TASAAYEDVD LDDAYQGDNP
DSLVTGLNGE KNGYYVKGGY MLPTMPLQFF VRYERWRFAE LNGVFDQRID WYGGGFNYYL
RNQNLKLTFE ANSTGFNKGG GTETTEDFMT YITQLQLIF