Gene Anae109_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3024 
Symbol 
ID5374742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3521603 
End bp3523090 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content77% 
IMG OID640844549 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_001380205 
Protein GI153005880 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCC CGGTCCTCGA ACGCTCCGCC CCGCCAGCGC CCGCGGCCAA CTGCACGTTC 
GCGCAGGGCT GCGGCGACGC CGCGCCCACG CTTCCGGAGG ACGTGGCCGC GCTGGTCGCC
GACCACCCCT GCTACAGCGA GGAGGCGCAC CACCACTTCG CGCGCATGCA CGTCGCGGTG
GCGCCCGCCT GCAACGTGCA GTGCCACTAC TGCAACCGGA AGTACGACTG CGCCAACGAG
AGCCGACCGG GCGTCGTCTC GGAGCGGCTC CGGCCCGAGG ACGCGGTCCG CAAGGTGCTC
GCCGTCGCGG CCGAGCTCCC CGAGCTGTCG GTGGTCGGCA TCGCCGGGCC CGGGGACGCG
CTGGCCAACG CCGACGCGAC CTTCGCGACC CTCGAGGGCG TGCACCGCGC CGCGCCGGAC
CTGCGGCTCT GCGTCTCGAC GAACGGGCTC GCGCTCCCGG AGCACGCGGA GCGGCTCGCC
GCGGCCGGGG CGCGCCACGT GACGGTGACC GTGAACATGA TCGACCCCGC CGTGGGCGAG
CGGATCTACC CCTGGGTCCT CCGGGGCGGG CGCAAGGTGC GGGGCCCCGA GGCGTCGAGG
ATCCTCTCGG CGCAGCAGCT CGAGGGGATC GCCGCGCTCG CCGCGCGCGG CGTGCTCGTG
AAGGTGAACT CCGTCGTGAT CCCGGGCGTG AACGACGCGC ACCTGCCGGA GGTGACGCGG
GCCGTGCGGC GGGCGGGCGC CTTCCTCGTG AACCTCGTCC CGCTCATCTC GGCGCCGGAG
CACGGCACGC ACTACGGCCG GACCGGCCGG CGCGGTCCGA CCGCCCTGGA GCTGGAGGAG
GTCCAGCGGG CCTGCGAGCT CGACGCGCGG CTCATGCGGC ACTGCCGCCA GTGCCGGGCC
GACGCGGTGG GGCGCCTCGG GGAGGATCGC TTCACCTCGT TCACCCTGGC CTCCCTCCCG
CCGGGGCCTG CCCGCGACGC CCGGGAGGCG CGAGCAGCGC GCCGCGACGC GATCGCGCGC
GTGCGGGGAC GGGTCGCGAG CGAGCGCGAC GCCGCGCTCC AGCGTCTGGC GGCGGTGCCG
GCGGCGGTCT CGGCGAGGAT CGCGGTGGCC ACGCGCGGGG GCGGGCAGGT GGACGAGCAC
TTCGGCCAGG CGCGCGAGCT GCTCGTGTAC GACGTCAGCC GCGCCGGCGC GCGCCTCGTC
GGACGGCGCC CCGTCGAGCG CTACTGCGTC GGTGGCGAGG GGGACGAGGA CGCGCTCGAC
GGCATGCTGC GCGCGCTCGG CGGCTGCCGC GCCGTGCTGG TCTCGAAGAT CGGGCGCTGT
CCCAGCGCAC GGCTCGCCGC CGCGGGCATC GAGGCGGTGG TCGATCAGGC GTTCCGGCCG
ATCGAGACGG CCGCGCTGGC CTGGTTCGAG GGGTTCGCCG CGCGCGCGCG GGCGGGCGAG
GTGGAGGCCG CCGGGGGCGA TCCCGCTCCG CGGGCGGAGG TCGCGTGA
 
Protein sequence
MRLPVLERSA PPAPAANCTF AQGCGDAAPT LPEDVAALVA DHPCYSEEAH HHFARMHVAV 
APACNVQCHY CNRKYDCANE SRPGVVSERL RPEDAVRKVL AVAAELPELS VVGIAGPGDA
LANADATFAT LEGVHRAAPD LRLCVSTNGL ALPEHAERLA AAGARHVTVT VNMIDPAVGE
RIYPWVLRGG RKVRGPEASR ILSAQQLEGI AALAARGVLV KVNSVVIPGV NDAHLPEVTR
AVRRAGAFLV NLVPLISAPE HGTHYGRTGR RGPTALELEE VQRACELDAR LMRHCRQCRA
DAVGRLGEDR FTSFTLASLP PGPARDAREA RAARRDAIAR VRGRVASERD AALQRLAAVP
AAVSARIAVA TRGGGQVDEH FGQARELLVY DVSRAGARLV GRRPVERYCV GGEGDEDALD
GMLRALGGCR AVLVSKIGRC PSARLAAAGI EAVVDQAFRP IETAALAWFE GFAARARAGE
VEAAGGDPAP RAEVA