Gene Lcho_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1372 
Symbol 
ID6159990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1454682 
End bp1456463 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content68% 
IMG OID641664126 
Producttranscriptional regulator NifA 
Protein accessionYP_001790405 
Protein GI171058056 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACC GGACGGTGAT GAATGAACGG TCTCACGTCG AGCTGATCAC GATCTACGAG 
ATCTGCCGCA TCCTCGGTGC CTCGCTCGAC ATCGGGCGCA CGTTTCGCGC CTCGTTGAAC
GTGCTGGCTG CGCACCTCGG CCTGTCGCGC GTGATGATCG TGATGGCGCC CGATGACCGG
GATGGCGAGG CACGGCTGCG TGTGCACAGC TCCGTCGGGC TGGATCAGGA GCAGGAGCGC
CGCGGCCAGT GGCATTACGG CGAAGGCGTG ATCGGCCACG TCTATGCCAG CGGCATGCCG
GTGGTGGTGC CCGACGTGGC GCAGGCCGCG GAGTTCATCG ATCGCACCGG GGCTTTCGGC
GCACAGGCCG ATCGCATGAT GGCGTTCGTG GTGGTGCCGC TCAAGACCGA CCGGGCCGTG
GTCGGCGTGC TGGCGGCCCA GCGCGAGGTC AGCGGCGGTG TGCGTCTGTC GGACGATCAG
CGCATCCTCA CGATGGCCGC CACGCTGATG GCGCAGGCCG CGTCGTTGCA CGGCGCGGTC
ACCGAGGAAC ACAAGCGCCT GCAGCTCGAG ACCACCCGCC TGCAGAAGGC GCTGGCGCCC
GAGCCGCGCG GGCGCTACAC GCTCGACAAC GTGGTCGGCG TGTCGCGTGG CATGCAGCAG
GTCTTCAGCG AGGTGCACCA GGCCGCGCCC GCGCGTGCCA CCGTGCTGCT GCGCGGCGAG
AGCGGCACCG GCAAGGAGGC GATCGCGCGG GCGCTGCATT ACCTGTCGCC ACGCAAGGAC
GCACCGTTCA TCAAGGTCAA TTGCGCGGCA TTGACCGAAT CACTGCTCGA GAGCGAACTG
TTCGGCCACG AGCGGGGCGC CTTCACCGGT GCGGCCGGTG ACCGCAAGGG CCGCTTCGAG
CAGGCGCACA CCGGCACGCT GTTCCTCGAC GAAGTGGGCG ACATCTCGGC ATCGTTCCAG
GCCAAGCTGC TGCGGGTGCT GCAGGAGCGC GAGTTCGAGC GCGTGGGCGG CAACCGCGCG
ATCAAGGTCG ACGTGCGGCT GATCTGCGCC ACCAACCGCG ACCTCGAGAA GATGGTCAAG
CGTGGCGAGT ACCGCGCCGA CCTCTATTAC CGCATCAACG TGGTGTCGAT CTTCCTGCCG
CCGCTGCGTG AGCGGCGCGA CGACATCCCG GCCCTGGTGT CGCACTTCGT CGACCGCTTC
AACAAGGAGA ACCGCCGCAA GCTCAGGATC GCCGGTGACG CGATGGAGGT GCTGTCGCAC
TGCTACTGGC CGGGCAATGT GCGCGAACTC GAGAACTGCA TCGAGCGCAC CGCCACCATG
GCCAATGGCG ACCTGATCCG CGGCGTGTCG TTCCCGTGCC GCCACAACCG CTGCCTGACG
CAGACGCTGC ATCACCTCGA GAAGGACGAC GCGGTCGCGC CGATCAGCAT GCCGACGTCG
CTGCACATCC CCGCACGGCT GCCGATCCGG GAGTCGCCGC GGCCGGTGCC CGGGTCGATC
CCGGCTTCGG CGGCTGCCGC ATCGGCGGAC CCCGATGACG ACTGGCTCGA CGACGAGGCC
GATGGCGGCG ACGACGTGTT GCGCATCGGC CCGACCGAAC ACCTCACCGG TACACCGCGC
CAGTTCGGCA ACGAGCCACC GGATGGCGAG CGCGAGCGCC TGATCTGGGC CATGGAGCAA
TGTGCCTGGG TGCAGGCCAA GGCGGCGCGG CTGCTGCACG TGACGCCGCG CCAGCTCGGA
TATGCGCTGC GCAAGTACAA CATCGAGGTG CACAAATTCT GA
 
Protein sequence
MLNRTVMNER SHVELITIYE ICRILGASLD IGRTFRASLN VLAAHLGLSR VMIVMAPDDR 
DGEARLRVHS SVGLDQEQER RGQWHYGEGV IGHVYASGMP VVVPDVAQAA EFIDRTGAFG
AQADRMMAFV VVPLKTDRAV VGVLAAQREV SGGVRLSDDQ RILTMAATLM AQAASLHGAV
TEEHKRLQLE TTRLQKALAP EPRGRYTLDN VVGVSRGMQQ VFSEVHQAAP ARATVLLRGE
SGTGKEAIAR ALHYLSPRKD APFIKVNCAA LTESLLESEL FGHERGAFTG AAGDRKGRFE
QAHTGTLFLD EVGDISASFQ AKLLRVLQER EFERVGGNRA IKVDVRLICA TNRDLEKMVK
RGEYRADLYY RINVVSIFLP PLRERRDDIP ALVSHFVDRF NKENRRKLRI AGDAMEVLSH
CYWPGNVREL ENCIERTATM ANGDLIRGVS FPCRHNRCLT QTLHHLEKDD AVAPISMPTS
LHIPARLPIR ESPRPVPGSI PASAAAASAD PDDDWLDDEA DGGDDVLRIG PTEHLTGTPR
QFGNEPPDGE RERLIWAMEQ CAWVQAKAAR LLHVTPRQLG YALRKYNIEV HKF