Gene GM21_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1403 
Symbol 
ID8136731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1649964 
End bp1650929 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content65% 
IMG OID644869017 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_003021220 
Protein GI253700031 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000126451 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAAAA CGGTGGCCAT CGGCCCACTT GTGTTGAAGA ATCAGCTATT TCTGGCCCCG 
ATGGCGGGGA TCACCAATCT GCCGATGCGG ATCGTCTCCC GCGAGGGAGG GGCCTCGTTC
GCCTTCACCG AGATGGTCAG CGTGAACGGC CTGACCCGGG AAGGGCGCAA GAGTTTCGAT
CTGTTGAAAA GCTGCGCCGA GGACCGCCCC ATCGGGATGC AGCTTTTCGG GGACGACCCG
GAGATGCTGG CCGAGGCGGC CCGCCTGGTG GAAGATCACG GCGAGCTGAT CGACATCAAC
ATGGGGTGTC CGGTGCGCAA GGTGGTTGGA ACCGGCGCCG GCAGCGCGCT GATGAAGGAC
CCCCGAAAGG TGGGGCGAAT CGTCAGGAGT GTCAGGGCCG CGACGAGGCT CCCGCTCACC
ATAAAGATAC GGACCGGCTG GGTCTGCGGC GACGACACCT TCCTTGAGGT GGGGAGAATA
GCCCAGGAGG AGGGGTGCGA CGCCGTGACG CTGCATCCCA GAAGCCGCGC CCAGATGTTC
GAGGGGAAGG CCGACTGGTC CCGGATCGCC GAGCTGAAGA GCACGCTCAG CATTCCGGTG
ATCGGCAGCG GCGATCTTTT CAGCGCCGCC GACGTAGCGG GGATGCTGGC CGAAACCGGA
TGCGACGCCG TCATGGTGGC CCGCGGCGCC ATGGGGAACC CCTGGATCTT CCGTGAGGCG
CTCTCGCTTT TGGCCGGGGA GGAGCCTGCT CCGCCCTCGG TGCAGGAGCG GCTGGCGCTC
TCCCGCAGGC ACCTGGAGCT TTTCACCGAG TTCGCGGGAG GGCGGGTGGC GCTCATGGAG
ATGCGCAAGC ACCTCTCCTG GTACTCAAAG GGACTGCCGG GGGCGGCGCA CTTTCGCGCG
GCGGTGAACC GGATCGAAAG TGCTCCGGAG CTGATCCGGG CGATGGAGGA GTTCTTCGAT
GTCTGA
 
Protein sequence
MQKTVAIGPL VLKNQLFLAP MAGITNLPMR IVSREGGASF AFTEMVSVNG LTREGRKSFD 
LLKSCAEDRP IGMQLFGDDP EMLAEAARLV EDHGELIDIN MGCPVRKVVG TGAGSALMKD
PRKVGRIVRS VRAATRLPLT IKIRTGWVCG DDTFLEVGRI AQEEGCDAVT LHPRSRAQMF
EGKADWSRIA ELKSTLSIPV IGSGDLFSAA DVAGMLAETG CDAVMVARGA MGNPWIFREA
LSLLAGEEPA PPSVQERLAL SRRHLELFTE FAGGRVALME MRKHLSWYSK GLPGAAHFRA
AVNRIESAPE LIRAMEEFFD V