Gene Lcho_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1350 
Symbol 
ID6161325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1437293 
End bp1438666 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content68% 
IMG OID641664104 
Productnitrogenase molybdenum-iron cofactor biosynthesis protein NifN 
Protein accessionYP_001790383 
Protein GI171058034 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCACG TCGTCGAATC GAAGAAGTCC TGCGCGGTCA ACCCGCTCAA GATGAGCCAG 
CCGCTCGGTG CTGCCTACGC CTTCATGGGG CTGGCGAGCT GCATGCCGGT GATGCACGGC
TCGCAGGGCT GCACCTCGTT CGGCCTGGTG CTGCTGGTGC GGCATTTCAA GGAGGCGATC
CCGCTGCAGA CCACCGCGAT GAACGAGACC ACCACCATCA TGGGTGGCTA CGACAACATC
GAGCGCGCCT TGCTCAACAT CCGCAGCCGC GCCAAGCCGC AGCTGATCGC GATCTGCTCG
ACGGGGCTGA CCGAAACCAA GGGCGACGAC GTCGAGGGCT ACCTGACGCT GGCGCGCAAG
AAGCACCCCG AGCTCGCCGA CACCGCGATC GTCTACGTCA GCACGCCCGA CTACGCCGGT
GCGTTTCAAG ATGGCTGGGC CAAGGCCGTG ACGACGCTGG TGTCGCAACT GCCGCACACC
GATCTGCCGC GCCTGCCGGA CCGCATCAAC GTGCTGCCGG GTTGCCACCT GACGCCGGGT
GACATCGAGG AGTTGCGCGA GCTGATCGAG GCTTTCGGGC TGAAGCCGAC TTTCGTGCCC
GATGTGTCGG GCTCGCTCGA CGGCCACATA CCCGACGACT GGCTGGGCAC CACGCTCGGC
GGCACGTCGC TGACGGACAT GCAGCAGCTC GGCGCCGCGG CCCACACGCT GGCGATCGGC
GAGCAGATGC GCCCGGCCGC CGAGGCGCTG CAGAAGCGCT GCGGCGTGCC GTACACGCTG
TTCGACCGGC TCACCGGCCT GACCGCCACC GACGCACTGA TCCGGCATCT GACCACGCTG
TCCGGCCGCC CCGTGCCCAC GCGCATCAAG CGCCAGCGCA GCCAGCTGGT CGACGCGATG
CTCGACGGCC ATTTCCACTT CGGCAACGTC AAGATCGCGC TCGGTGCCGA GCCCGATCTG
CTGTGGGCCA TCGGCAGCTT TTTGACCGAG ATGGGCGCCG GGCTGGCGGT GTGCGTGACC
ACGACCGCCT CGCCGCTGCT GGCACGTTTG CCGAGCAACG AGGTCATCGT CGGCGATCTC
GAGGATTTCG AGATGGCCGC CAAGGCCGCC GGCTGCGACC TGCTGATGAC GCACTCGCAT
GGCCGCCAGG CGGCCGAGCG GCTGGGCAAG CCGCTGTTTC GCCTGGGCAT CCCGACCTTC
GACCGCATCG GCAATTCGCA CAAGTGCTAC GTGGGCTATC GCGGCACGCG CAACTTCGTC
TACGAGGTCG GCAACGTGCT GATGGATCAC ACCCCGCGCC ACGGCCCCGA CGCCTGGCCG
CTCGGTGAAC AGGCGCTGGC GGCGGCGCGT GGCGAGGCGG CGGTCAGCGT CTGA
 
Protein sequence
MAHVVESKKS CAVNPLKMSQ PLGAAYAFMG LASCMPVMHG SQGCTSFGLV LLVRHFKEAI 
PLQTTAMNET TTIMGGYDNI ERALLNIRSR AKPQLIAICS TGLTETKGDD VEGYLTLARK
KHPELADTAI VYVSTPDYAG AFQDGWAKAV TTLVSQLPHT DLPRLPDRIN VLPGCHLTPG
DIEELRELIE AFGLKPTFVP DVSGSLDGHI PDDWLGTTLG GTSLTDMQQL GAAAHTLAIG
EQMRPAAEAL QKRCGVPYTL FDRLTGLTAT DALIRHLTTL SGRPVPTRIK RQRSQLVDAM
LDGHFHFGNV KIALGAEPDL LWAIGSFLTE MGAGLAVCVT TTASPLLARL PSNEVIVGDL
EDFEMAAKAA GCDLLMTHSH GRQAAERLGK PLFRLGIPTF DRIGNSHKCY VGYRGTRNFV
YEVGNVLMDH TPRHGPDAWP LGEQALAAAR GEAAVSV