Gene GM21_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0594 
Symbol 
ID8135909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp726787 
End bp728127 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content64% 
IMG OID644868211 
Productribonuclease BN 
Protein accessionYP_003020426 
Protein GI253699237 
COG category[S] Function unknown 
COG ID[COG1295] Predicted membrane protein 
TIGRFAM ID[TIGR00765] YihY family protein (not ribonuclease BN) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0000174757 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATA CCAGCGCAAA AAGGCCTTTT TCCACCGGGC TCATCTGGGA GTACGACCCC 
TCCGCCGTCG GCGGCTTCAA GGGAAGGATG GTCCGGCTGC TGCAGTTCTT CACCTTCACC
TTCTCCAACT TCATGGCCAA CAACTCGCTT TTGCGCGCCA CCGCCCTCTC CTTCACGACG
GTCCTGTCGC TCGTGCCGCT TCTTGCCCTG GCCTTCTCGG TACTGAAGGG GCTGGGCGCC
CAGAACCGGC TCGCTCCCCT GATCCTGAAA CAGGTGACCG CCGGCTCGGA GGAGGTGGTG
AACCGGGTGG TTTCCTACAT CGGCAACACC AACATGGGGT CGGTCGGCGC CATCGGCCTC
GCGGCGCTCA TCTTCACCGC CATCAGCATG CTGGCCAGCG TGGAGGAGGC GTTCAACGTC
GCCTGGGGGG TCCAGGAGAC CCGATCGCTG TACCGGAAAT TCAGCGACTA CCTGAGCGTC
CTCGTCAGCG CGCCGCTTTT GCTGCTTGCC GCCACCAGCA TCACCACGAC CCTGCAGAGC
AAATGGCTGA TCGGGTGGCT GTTGGAGCGG ACCTATCTGG GCGACCTCTT CCTCTTCATG
CTAGGGTTTA CCCCGTACCT GAGCGTCTGG GTGGCGATCT TTCTCCTCTA CATCTTCATC
CCCAACACCA GGGTCCGCTA TAGCTCCGCC CTGATCGGGG CGGTGCTCGC GGGGACCCTC
TGGCAGTTCG CCCAGTGGGC CTACATCCAC TTCCAGGTGG GAGCCGGCAA CTACAACGCC
ATCTACGGCA CGCTGGCGGC GCTCCCCATA CTGATGGTCT GGATCTACGT CAGCTGGATC
ATCGTCCTCT TCGGCATGGA GGTGGTGGCG GCGCACCAGA ACCGGGCTTT CTTTCGCCGG
GACATCCGGG GGAGGAGCAT CAGCCCGACC CTGCAGGAGC TGGTGGCGCT CGCCGCGCTT
AGGCACATCG GGGAGGCTTT CTATGAGGGG GCCGCGGGGT GGGGAGAGCA GCACCTGGCC
GCCAAGCTGA ACATGCCGCT GCGCATCGTG CGCGACACGC TGGAGCACCT GCGCGAGGCG
GGCTTCCTGG TCTGCGCGGG CGAGGGGGAG TGCTACTACC CGGCGCGGGA CCTGAACCGG
GTAACCATCG CCGAGGTGCT CCTGTCCTTA AGAAACCGCG GGGCCTATGC TTCCATCGCC
GGAGAGGAGC AGGCCGAGCG GATCGTGGAA ACGGTCGACG CGGCGCTGGA GAAGGCCCTG
GAGGGGCGAA CGCTGATGGA TCTGGCTGCT CCAGAAAAGA GTCCGGCCGG CGTTGACAAA
GAGGGAGCAG TTGACATATA G
 
Protein sequence
MKNTSAKRPF STGLIWEYDP SAVGGFKGRM VRLLQFFTFT FSNFMANNSL LRATALSFTT 
VLSLVPLLAL AFSVLKGLGA QNRLAPLILK QVTAGSEEVV NRVVSYIGNT NMGSVGAIGL
AALIFTAISM LASVEEAFNV AWGVQETRSL YRKFSDYLSV LVSAPLLLLA ATSITTTLQS
KWLIGWLLER TYLGDLFLFM LGFTPYLSVW VAIFLLYIFI PNTRVRYSSA LIGAVLAGTL
WQFAQWAYIH FQVGAGNYNA IYGTLAALPI LMVWIYVSWI IVLFGMEVVA AHQNRAFFRR
DIRGRSISPT LQELVALAAL RHIGEAFYEG AAGWGEQHLA AKLNMPLRIV RDTLEHLREA
GFLVCAGEGE CYYPARDLNR VTIAEVLLSL RNRGAYASIA GEEQAERIVE TVDAALEKAL
EGRTLMDLAA PEKSPAGVDK EGAVDI