Gene Clim_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0689 
Symbol 
ID6354303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp762561 
End bp764195 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content50% 
IMG OID642668316 
Producttranscriptional regulator, NifA subfamily, Fis Family 
Protein accessionYP_001942751 
Protein GI189346222 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.430174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATTC CCCAGAAAAA AAAAGACAGC AGCATCAGCC TTCTGGCTGA AGTCAGCAGA 
ACTGTAACGA TTGAAAAAGA TATCAGCAAG GTGCTCCGCC TGGTACTTTT CATCATGTCG
GAGCATATGG ATATGCTTCG CGGAATGATC ACCATTCTCA ACCGCGATAA TGACGAAATA
GTCATCAATG AATCATTCGG ACTGAGCGAA GAAGAAAAAG AACGTGGACG CTACCGGATA
GGAGAGGGCA TTATCGGTCA GGTCGTAAAA ACCGGTAAAC CGGTTCTGGT ACCAAATATC
AATGATGAAC CATTGTTCCT TGACCGTACC CGTTCCCGTC AGAAGGAGAG AACCGACGAC
CTTTGTTTCA TCTGCATTCC CATAAAGACG GGAACCGAGA TCATCGGAAC CCTCAGCGCC
GATCGTCAGA TTGAACCGCC ATTTCCCGAA GACCCGTCGA AACGGGCCAA AGCGGAAAGC
GAACGGATGG ACATGATGCA GCACTACGTC GACCTGCTTT CCATTATCGC GTCCATGATT
TCTCAGGCGG TAAGGCTCAA ACAGCTTGCT CACGAGGAGA ACTCGAATGG AACAGGCACA
ACGCACTCGC TGAAAGGGAA AAATCTGCTC ATCCCTCACC GGGACAATGA CAGCCATGAG
GAAGAGGTGG ATGAAACGGA ACGCCCGGCA AACATTATCG GCAATGCAAA ACCGATGATG
TCATTGTTCA AAATGATCGA CAAAATCGCA AAAACCAGTG CGACAACTCT GGTGCTGGGC
GAAAGCGGTG TAGGCAAAGA ACTCGTCGCC AGCGCCATTC ACTTTAAAAG CCGTCGCTCC
GACAAGCCGT TTATCAAATT CAATTGTGCA GCCCTACCGG AAAGCATTGT AGAAAGCGAG
TTGTTCGGCC ATGAAAAAGG CTCTTTTACC GGAGCCTCGG GTATGCGTCA GGGACGGTTC
GAGCTGGCCC ATACCGGCAC GATATTTCTT GATGAGATCG GAGAACTCAG CTTGCCGGTA
CAGGCGAAAC TGCTTCGCAT CCTTCAGGAA AAAGAGTTCG AACGGGTTGG CGGCTCGAAA
ACCATCAAAG TCGATGTCAG AGTTATTGCC GCAACCAACA GGAACCTGGA AAACCTCATC
CGTGAAGGAC AGTTCAGGGA AGATCTGTTC TATCGGCTGA ATATTTTTCC GTTGACCGTA
CCGCCGCTCA GGGAGAGAAA AACCGATATA CTGCTGCTCG CAGATTACTT CGTCGAAAAA
TATAACCGGA TCAACCAGAA AGGAATCCGC CGAATTTCAA CGACATCGAT AGACATGCTG
ATGCGCTACC ACTGGCCCGG CAATGTGCGT GAACTGGAAA ACTGCATGGA ACGAGCGGTC
ATTCTCAGCG AAGATAACGT CATTCACGGC TATCACCTTC CGCCAAGCCT GCAGACTGCG
GAATCGAGCG GCACCCCGTA TACCGGCTCA CTGCAGCAAA AGCTTGACTC GATCGAAAAT
GAAATGATCA TCGAAGCGCT CAAACGCACA AAAGGAAATA TGTCACGGGC GGCTATACAA
CTCGGCCTCT CGGACAGAAT CATGGGGTTA CGGGTAAAAA AATTCAACAT CGACTATCGA
AAGTTCCGTA TATGA
 
Protein sequence
MLIPQKKKDS SISLLAEVSR TVTIEKDISK VLRLVLFIMS EHMDMLRGMI TILNRDNDEI 
VINESFGLSE EEKERGRYRI GEGIIGQVVK TGKPVLVPNI NDEPLFLDRT RSRQKERTDD
LCFICIPIKT GTEIIGTLSA DRQIEPPFPE DPSKRAKAES ERMDMMQHYV DLLSIIASMI
SQAVRLKQLA HEENSNGTGT THSLKGKNLL IPHRDNDSHE EEVDETERPA NIIGNAKPMM
SLFKMIDKIA KTSATTLVLG ESGVGKELVA SAIHFKSRRS DKPFIKFNCA ALPESIVESE
LFGHEKGSFT GASGMRQGRF ELAHTGTIFL DEIGELSLPV QAKLLRILQE KEFERVGGSK
TIKVDVRVIA ATNRNLENLI REGQFREDLF YRLNIFPLTV PPLRERKTDI LLLADYFVEK
YNRINQKGIR RISTTSIDML MRYHWPGNVR ELENCMERAV ILSEDNVIHG YHLPPSLQTA
ESSGTPYTGS LQQKLDSIEN EMIIEALKRT KGNMSRAAIQ LGLSDRIMGL RVKKFNIDYR
KFRI