Gene Lcho_3469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3469 
Symbol 
ID6159786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3882930 
End bp3884231 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID641666243 
Productintegrase family protein 
Protein accessionYP_001792490 
Protein GI171060141 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0423889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGA CGAACAACCT CGACGACAAG AGCATCCGTG CCGCGATCAA GCGGGCGATG 
AAGGCGCAGG CCGGCGAGCG GCTGACCGAT GGCGATGGCT TGCGCCTGGA CGTGCAGCCG
ACCGGTTCTG CCTGGTGGCG CTGGCGCTAC CGCTTCGGCG GGAAGGAGGG GATGCTGTCT
CTCGGCACCT ACCCCGACAC GTCCCTGTCG GCAGCTCGCG GCAGGCGCGA CGAGGCGCGC
GAGCGGCTGG CAGCCGGGAT CAACCCGAGC GAGGCGCGCA AGGATGACAA GGCGGCCCAG
GCGCTGAAGG CTGAGGCTGC CCGCCTGGCC GCAGCGGGAT TGCCCGGGCC TGGCACGTTC
GAGCACGCGG CCCGGGAGTG GCATGCCCGC ATGGCGCCGA GCTGGTCGGA AGGGCACGCT
GGCAAGGTGC TGGCGCTACT GGTGAATGAC CTGTTCCCCT TCATCGGCAC GAGTGCGCTT
GCCGAGCTGA CCCCGCCCGA GCTGTTGAAG CACGCTCGGC GCATCGAGGC CCGCGGTGCG
GTCGAAACCG CATACCGGGC CCTGAAGGCG GCTGGCGCCG TGTTCCGCCA CGGCGTGCAG
AACGGCTACT GCGACAGCGA CCCCACGCGA GACCTGAAGG GCGCCATCGT GCTGCCCGTA
CCGGAGCATC GGGCTGCCAT CACCGACCCA GCCAGGTTGG GCGAACTGCT GCGGGCCATC
GACGGATACC AAGGCACGCC GGTTGTACGC TCCGCTCTCG CGCTGGCGCC GCTGGTTTTC
CTGCGACCGG GTGAGCTGCG CAAGGCCGAG TGGGCAGAGT TCGACCTCGA CGCAGCCGTG
TGGACCATCC CGGCTGCGCG CATGAAGGGG CGGTTGAAAG CCAAGCTCAA CGGCCCCGAT
CATGTGGTGC CGCTGGCGCC ACAGGCGGCG GCAATCCTGC GCGACCTGCA ACCGCTGACG
GGTGCCGGCA AGTACGTGTT CCCGAATCCG CTCACGCCCG ACCGTCCGCT ATCCGACAAC
GGTGTGCTGT CAGCGCTGCG CCGGATGGGC TTCGACAAGG ACGAGATGAC GGGCCACGGA
TTCCGTGCCA CGGCGCGGAC CATCGCGGCC GAGCGGCTGA AGATCGACCC CGTGGTGCTC
GAAGCGCAGC TTGCGCACGT AGTGGCCGAT GCGCTGGGCC GGGCCTACAA CCGTACGCAG
TACCTCGACC AGCGCCGCGA CATGATGACC CGCTGGGCCG ACTACCTGGA CCGCCTGCGC
AAGGGCGCAG AGGTGGTCGA CCTGACAAGC AAGCGGGCCT GA
 
Protein sequence
MAATNNLDDK SIRAAIKRAM KAQAGERLTD GDGLRLDVQP TGSAWWRWRY RFGGKEGMLS 
LGTYPDTSLS AARGRRDEAR ERLAAGINPS EARKDDKAAQ ALKAEAARLA AAGLPGPGTF
EHAAREWHAR MAPSWSEGHA GKVLALLVND LFPFIGTSAL AELTPPELLK HARRIEARGA
VETAYRALKA AGAVFRHGVQ NGYCDSDPTR DLKGAIVLPV PEHRAAITDP ARLGELLRAI
DGYQGTPVVR SALALAPLVF LRPGELRKAE WAEFDLDAAV WTIPAARMKG RLKAKLNGPD
HVVPLAPQAA AILRDLQPLT GAGKYVFPNP LTPDRPLSDN GVLSALRRMG FDKDEMTGHG
FRATARTIAA ERLKIDPVVL EAQLAHVVAD ALGRAYNRTQ YLDQRRDMMT RWADYLDRLR
KGAEVVDLTS KRA