Gene GM21_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3071 
Symbol 
ID8138421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3559983 
End bp3561122 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID644870675 
Producthypothetical protein 
Protein accessionYP_003022857 
Protein GI253701668 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.000000174945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGAT ACCGCATACA TTGCCTGCCG GGCCTCCTGC TGGTGGGGGC ACTTTGCTTT 
GGCGCGCCCG CTTTGGCCGC GACGCCCACA GCCGTCGCTG CGCCGGGCGC CGCCGAGACG
ATGCGACCAA GGACCGAAGC GGAGTTCACC CCCACTTTTC GCTACGGCGA CCGTGTGCTC
ACCGAGGACA CCGTCTGGAG AGGGGTGGTG CTGGTGGAAG GGGCGGTGAC CGTGGCGCCT
CAGGCGACCC TGACCGTCGA GCCCGGGACC GTGATTCGCT TCAGGGGGGA TGACGCCTCC
GGCGCAGTGC TGGTGGTGCA GGGGAGGATG GCGGCTGCCG GAACAAAGGA ATCCCCCATC
GTTTTCACCT CCAGTTTTGC CGTACCTGCC GCAGGGGACT GGCAGGGGGT GATGCTCCTG
GGGAGCGAGA AGAGAAACGT CCTTGAGAAC TGCCGCATCG AGGCCGCGCA GACCGGACTT
GAAGCTATTT TCTCCAACCT GACGCTGAAG AACGTGCGGG CCGAGCGGAG CAAGGCCGGG
ATGAGGTTTC AGGACGCCCT GGTCGTGATG GAGGGAGGCG GGACCAGCGA TTGCGATACC
GGCCTCAACT TCTCCGAGAG CGAGGCGACC TTGCGCAACC TGAACCTGAT CGGAAACCGC
AAAGGGCTCG TCGCCCAGCG CAGTTCCATT TATCTGCAGG AGGGAAGCTT TTCCATGAAC
GGCTCCGCCT TCTCGTGCGA CAGCTGCCGG GTCAGGCTGC AGGGGGGAGG GGTGTCGGAC
AACGGCAGGG GAATCACCCT GTACGAGAGC GAAGGGTCGG TCACCGGCGT TGAGGTGGCG
CGCAACAGCG ACTACGGCAT TTCGCTCGCC ACCTCCCGGA TAAGGATCAC CGGGAACCAG
ATCACCGGCA ACGGCAACAG CGGCCTTTTG GTCTTCGATG CCTCTTCCGT CGCCTGGGAC
AACGCCATCC ATGACAACGG CTACGACCTT TACAACGCCG GCAAGGAGGA GTTCCGGGCG
CCGGGCAACT GGTGGGGGGC GGCCGGGCCG AAGATTTACG ACAACGGGGG AGCCGGGAAG
GTCCTCTCCA CCCCGCGGCT CACAGCACCG CCTGAAGCAG GTTCTAAAGA TAAACCCTAA
 
Protein sequence
MNRYRIHCLP GLLLVGALCF GAPALAATPT AVAAPGAAET MRPRTEAEFT PTFRYGDRVL 
TEDTVWRGVV LVEGAVTVAP QATLTVEPGT VIRFRGDDAS GAVLVVQGRM AAAGTKESPI
VFTSSFAVPA AGDWQGVMLL GSEKRNVLEN CRIEAAQTGL EAIFSNLTLK NVRAERSKAG
MRFQDALVVM EGGGTSDCDT GLNFSESEAT LRNLNLIGNR KGLVAQRSSI YLQEGSFSMN
GSAFSCDSCR VRLQGGGVSD NGRGITLYES EGSVTGVEVA RNSDYGISLA TSRIRITGNQ
ITGNGNSGLL VFDASSVAWD NAIHDNGYDL YNAGKEEFRA PGNWWGAAGP KIYDNGGAGK
VLSTPRLTAP PEAGSKDKP