Gene Clim_0716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0716 
Symbol 
ID6354330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp789106 
End bp790680 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content51% 
IMG OID642668343 
ProductIntegrase catalytic region 
Protein accessionYP_001942778 
Protein GI189346249 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.323373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGACAC AGTTAAAAAA GCTAACCATG TACAACAAAG TTAAGGAATT TGCCCGAGAA 
GGATTAAGCA TCCGTCAAAT CAGCCGAAAG ACGGGCATGG ACAGAGTGAC GGTGCGCAAG
TTTCTCCGCA TGACCGATGA GGAATTCAGT GCGTTTCTTG CTCTGCAGAA GCGGCGCCTG
CGAAAATTGC AGCCTTATGA ACAGTTCGTC AAGGATAGGG TTACCGACTA TCCTGACTGC
AGTGCAACTC AAGTTGAAGA CTGGCTGAAG GAGCATCACC CGGACTTTCC AGAGGTAACG
ACTCGAACGA TCTATTCTTT TGTCCAGTGG ATCCGAAAAA CCTATGATCT TCCAAAACCG
AAAGGAACCC CTCGTGCCTA TCATCCGGTC GAGCAACTTC CTTACGGAGA GCAGGCGCAG
GTTGATTTCG GTGAGTACTG GATGGCGAGT GCTGATGAAC ACAACGTGAA GGTTCACTTC
ATGATTATGC TGCTCTCCCG AAGCCGCAGG AAGTTTGTCA GCTTCAGCCA GCAACCGATT
ACGACCCGTT TTGTGCTTGA AGCTCATGAA CAGGCATTTG CCTTTTTTGA GGGCATACCG
CACACACTGG TTTATGATCA GGACTCAACC ATTGTTTCCG ATGAGAACCG GGGTGCCATC
CTTTATACGG AGGCGTTCAG GAAGTACCTG TTGCACCGCA GTCTGAAGAT CCATCTCTGT
CGGAAAAGCG ATCCGGAAAG CAAAGGGAAA ATCGAAGCCG GCGTCAAATA TGTGAAGTAC
AACTTCCTGC CGGGGCGACG CTTCGTCAAT CTTGAAGTCC TGAACCAGGA AGCGTTGCTC
TGGCTTGAAC GAACGGCCAA TGCCAAAGAA CATGCCACAA CGCGGCTGAT ACCTGAGGCA
GAATGGCAGG TGGAAAAACA GCATCTTCGT CCTTTTGAGC CCTTACCCTA TCCGATTTCC
GGGCCTGTCG GTAAAGAGTA CCATGTACGC AAAGACAACA CAATCTCGTA TCGAGGGAAT
TTCTATAGCC TGCCGGTCGG CACCTATGCA GGGCCGGGGA CACTGGTTGT GCTGGAAGTC
AGGCAGAACA CCCTTTGTCT CTATGCTCAT GACGGCAGGT TGCTGGCCAA TCACCCGATT
AAGAGCGGCA AAGGTACCGT GGTGGTCAAC AACCACCACC GACGCGATAC TTCCGCCAAA
CGGCGAGAGT TGCAGGACTC GCTCAAGCCG CTTTTCACCA ATCAGGAACA GGCGGAACTG
TTTCTTGAAA GCATCCACAA CCGTTATCCC CGGTACAGTC GGGACCAGTT CCTGCATGTA
CGCAATACCA TCAGCGGATG CCGGCAGAAG CTGATAGATG AGGCCCTCGC ATACTGTGTC
GATCATCATC TCTTTTCATC CGGTGAGTTC CATGATATCC TGCACCATTA CCGAAAGCGG
GAAGAAAAAC AGAGTCATCC GACGGTCTCC AACACCTTCC GCCCGAAAAC ACGCCGAAGC
GACCTGAACA GGATGCTCTC GTTCGTGCCG GACAGCAGTA CCATAACCAC CTATGAAACC
ATTTTCAGCT GTTAA
 
Protein sequence
MRTQLKKLTM YNKVKEFARE GLSIRQISRK TGMDRVTVRK FLRMTDEEFS AFLALQKRRL 
RKLQPYEQFV KDRVTDYPDC SATQVEDWLK EHHPDFPEVT TRTIYSFVQW IRKTYDLPKP
KGTPRAYHPV EQLPYGEQAQ VDFGEYWMAS ADEHNVKVHF MIMLLSRSRR KFVSFSQQPI
TTRFVLEAHE QAFAFFEGIP HTLVYDQDST IVSDENRGAI LYTEAFRKYL LHRSLKIHLC
RKSDPESKGK IEAGVKYVKY NFLPGRRFVN LEVLNQEALL WLERTANAKE HATTRLIPEA
EWQVEKQHLR PFEPLPYPIS GPVGKEYHVR KDNTISYRGN FYSLPVGTYA GPGTLVVLEV
RQNTLCLYAH DGRLLANHPI KSGKGTVVVN NHHRRDTSAK RRELQDSLKP LFTNQEQAEL
FLESIHNRYP RYSRDQFLHV RNTISGCRQK LIDEALAYCV DHHLFSSGEF HDILHHYRKR
EEKQSHPTVS NTFRPKTRRS DLNRMLSFVP DSSTITTYET IFSC