Gene Bind_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0476 
Symbol 
ID6200010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp538252 
End bp540141 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content56% 
IMG OID641704468 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifE 
Protein accessionYP_001831618 
Protein GI182677472 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTG TTTCCGCCAC AATTCAAGAC GTTTTCAACG AGCCGGGCTG CGCTAAAAAT 
GCCGGCAAGT CTGAAAAGGA AAAGAAAAAG GGTTGCACGA AACAATTGCA GCCTGGTGGC
GCAGCGGGCG GTTGTGCCTT CGACGGCGCC AAGATCGCCC TGCAGCCCAT TACCGATGTG
GCCCATCTTG TTCATGGTCC GATTGCCTGC GAAGGCAATT CCTGGGACAA TCGCGGCGCT
TTCTCGTCGG GTTCCACACT CTATCGCACC GGCTTTACAA CCGATATCAA TGAAACCGAT
GTGGTGTTCG GTGGCGAGAA GCGGCTCTAC AAGAGCATCA AGGAAATCAT CGAAAAATAT
AATCCGCCGT CTGTCTTCGT CTATCAGACT TGCGTCCCGG CCATGATCGG CGACGATATC
GACGCGGTGT GCAAGGCGGC TTCGAAAAAA TTCGGCAAGC CGATTGTGCC TGTGAATTCC
CCCGGCTTTG TCGGTCCGAA GAATCTCGGT AACAAGCTCG CGGGTGAGGC TCTGCTTGCT
CACGTGATTG GCACTATCGA GCCCGAATAT ACGACGCCTT ATGATGTGAA TATCATCGGC
GAATATAATC TGGCTGGTGA AATGTGGCAG GTTGAGCCGC TCCTGAAGGA GATCGGCTTC
CGTATCATTT CCTGCATTTC CGGCGATGCG AAATATAATG AAGTCGCGCA GGCGCATCGC
GCCAAGGCGA CGATGATGGT CTGCTCCAAG GCCATGATCA ATATTACGCG CAAGCTCGAA
GATAAATACG GTATTCCCTA TTTCGAAGGC TCGTTCTACG GCATCGGCGA TATGAGCGAT
TCCATTCGGC AGCTTGCCCA ATTGGTGATC GATCAGGGCG CGCCGCCGGA ATTCATGGAT
CGCGCTGAAG CGGTGATTGC CCGTGAAGAA AAGAAGGCCT GGGAGCGCAT GGCAGCCTAT
ACGCCGCGCC TCAAGGGCAA GAAGGTTCTG CTCATCACCG GCGGTGTGAA GTCCTGGTCC
GTGGTTGCCG CCTTGCAAGA AGTCGGTCTC GAAATCGTTG GCACCAGTGT CAAGAAGTCG
ACGAAGGAAG ATAAAGAGCG CATCAAGGAA TTGATGGGTG AGGATGCTCA CGCCTTCGAC
GATATGTCGC CGCGCGAAAT GTATAAAATG CTGAAGGATG CCAAGGCGGA TATCATGTTG
TCCGGCGGCC GTTCGCAGTT CATCGCGCTG AAAGCCTCTA TGCCGTGGCT GGATATCAAC
CAGGAACGTC ATCACGCCTA TGCCGGCTAT GAAGGCATGG TTGATCTTGT GCGCGAAATC
GACAAAGCGC TTTACAATCC GATCTGGGAA CAAGTGCGCA AGATTGCGCC GTGGGAAAAT
CCCGCGGAAA GCTGGCAGGC CAAGGCTGAT GCCGAAGCCG CGCGCGAGGC CGCTGAATTG
GCTGCCAACC CGGCCAAAGC GGAGGAGAAG CGGCGCTCGA AGAAGATTTG CAAGTGCAAA
TCCGTGGATC TCGGCACGAT CGAGGACGCT ATTAAGGCGA ATAATCTGAC AACCGCCAAG
CAGGTCACCG AAATCACCCA TGCGGGTGGT GGCTGCACGG GCTGTGTCGG CACGATCGAA
GGCATCATCG AGGAGCTTCT CAAGCCGGCT GATGCTGTCG CGCTCGATCC CGCGCAGGCC
GAAGAGAAGC GTCGTGCCAA AAAAGTGTGC AACTGCAAAG AAGTCACGGT GGGCACGATT
GAAGATGCCA TTCGTGATAA AGGATTGAGC ACAGCTGCTC AGGTGACGGC CGCCACGGAA
GCGGGCAGCG GTTGTGGCAG CTGCGGCGAG ACGATCGAGG AGATCCTTGG CGAGGTTCTT
GCTTCTATTC CGCCTATCGC GGCAGAGTGA
 
Protein sequence
MSSVSATIQD VFNEPGCAKN AGKSEKEKKK GCTKQLQPGG AAGGCAFDGA KIALQPITDV 
AHLVHGPIAC EGNSWDNRGA FSSGSTLYRT GFTTDINETD VVFGGEKRLY KSIKEIIEKY
NPPSVFVYQT CVPAMIGDDI DAVCKAASKK FGKPIVPVNS PGFVGPKNLG NKLAGEALLA
HVIGTIEPEY TTPYDVNIIG EYNLAGEMWQ VEPLLKEIGF RIISCISGDA KYNEVAQAHR
AKATMMVCSK AMINITRKLE DKYGIPYFEG SFYGIGDMSD SIRQLAQLVI DQGAPPEFMD
RAEAVIAREE KKAWERMAAY TPRLKGKKVL LITGGVKSWS VVAALQEVGL EIVGTSVKKS
TKEDKERIKE LMGEDAHAFD DMSPREMYKM LKDAKADIML SGGRSQFIAL KASMPWLDIN
QERHHAYAGY EGMVDLVREI DKALYNPIWE QVRKIAPWEN PAESWQAKAD AEAAREAAEL
AANPAKAEEK RRSKKICKCK SVDLGTIEDA IKANNLTTAK QVTEITHAGG GCTGCVGTIE
GIIEELLKPA DAVALDPAQA EEKRRAKKVC NCKEVTVGTI EDAIRDKGLS TAAQVTAATE
AGSGCGSCGE TIEEILGEVL ASIPPIAAE