Gene Lcho_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1107 
Symbol 
ID6161308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1182931 
End bp1184439 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content75% 
IMG OID641663861 
ProductAraC family transcriptional regulator 
Protein accessionYP_001790141 
Protein GI171057792 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0739915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTT GCATGCCGAG CCGATCCGCC CCCCTCGCCA GCACCGACGC GACCGCGCAT 
GCCGCCTGTT ATGCGGCGCT GGTGGCGCGC GATCGGCGCT TCGACGGGCG CTTCTTCGTC
GGCGTCACCT CCACCGGCAT CTACTGCCGG CCGGTCTGCG GCGTGCGCAC GCCCAAGGCC
GCGAACTGCC GTTTCTTCGA CAACGCCGCC GCCGCCGAGG CGGGTGGATT CAGGCCCTGC
CTGCGCTGCC GGCCGGAACT GGCGCCGGGG CTCGCGGGCA TCGACATGCC CTCGCGCCTG
GCCTGGGCTG CGGCCCAGCG CATCGAGGCC GGCGCGCTCG ACGATGGCGG GCTGACCGGC
CTGTCGGCGC GGCTGGGCAT CACCGACCGC CACCTGCGCC GCATCTTCAT GGCCGCCTTC
GGCGTCACGC CGATCGACTA CGCCCAGACC CAGCGGCTGC TGATCGCCAA GCGCCTGCTG
GCCGACACCA CGCTGCCGGT CACCGAGGTG GCGCTGGCGG CGGGCTTCGG CAGCCTGCGG
CGCTTCAATC ACCTGTTTCA AAGCCGCTAC CGGCTCACGC CGGGTGAACT GCGGCGCGCC
AGCGGCCCGG CTGCGCGCGA GGGCGGGCTG CACTTCGAGC TGGCGTTCCG CCCGCCGCTC
GATTGGCCGC GGCTGCTGGC CTTCCTGGCG GCGCGCTGCG TGGCCGGTGT CGAGGCGGTG
GCCGACGGCA CCTATCGGCG CAGCGTGCGG CTGCACGCCG GCGGCAGCGA GCACACCGGC
TGGCTGGCGC TCGGCCTGGC CGCGCACGGC GAGGCGATCG CGGTCGACGT CGCGCCCAGC
CTCTTGCGCG TGCTGCCGGC GCTGCTGGCC GGCGTGCGCC GCCTGTGCGA CCTGTCGTGC
GACCCGCAGG CGGTGGCCGC GGTGCTCGGC CCGCTGGCTG CCGATGCGCC GGGTCTGCGC
GTGCCGGGCG CGTTCGACGG CTTCGAGATG GCGGTGCGGG CCGTGCTCGG CCAGCAGGTC
ACGGTCAAGG CAGCGCACAC GCTGGCCGGC CGAATGGCGG CCGCGTTCGG CGAGCCGCTG
CGATCGGAGC AGTCGTTGGA CGGCGTCGGC CTGCTGTTCC CGACGCCGCA GCGGCTGGCG
TCGGCCGGCG CCGAGCAGAT CGCCGCGCTG GGTATCGTGC GCAGCCGCGC CGATGCGTTG
ATCGCGCTGG CGCAGGCGGT GTCGAGCGGC GAGATCGATC TCGGCCCGGC CGCCGACGTG
GCCCTGTGCA CCGAGCGCCT GCAAGCCCTG CCGGGCATCG GCCGCTGGAC CGCGCAGTAC
ATTGCGCTGC GCGCGCTCGG CTGGCCCGAC GCCTGGCCGA GCGGCGACGT CGCACTGATC
AAGGCGCTCG GCGCCGCCGG CCCGCGCGAG GCCGATGCGC TGGCCGAAGC CTGGCGCCCC
TGGCGCAGCT ACGCCACCGT GCAACTCTGG CGCCGGCTCG CCGAAGCGGC CACCGCCCCG
AAACCATGA
 
Protein sequence
MIVCMPSRSA PLASTDATAH AACYAALVAR DRRFDGRFFV GVTSTGIYCR PVCGVRTPKA 
ANCRFFDNAA AAEAGGFRPC LRCRPELAPG LAGIDMPSRL AWAAAQRIEA GALDDGGLTG
LSARLGITDR HLRRIFMAAF GVTPIDYAQT QRLLIAKRLL ADTTLPVTEV ALAAGFGSLR
RFNHLFQSRY RLTPGELRRA SGPAAREGGL HFELAFRPPL DWPRLLAFLA ARCVAGVEAV
ADGTYRRSVR LHAGGSEHTG WLALGLAAHG EAIAVDVAPS LLRVLPALLA GVRRLCDLSC
DPQAVAAVLG PLAADAPGLR VPGAFDGFEM AVRAVLGQQV TVKAAHTLAG RMAAAFGEPL
RSEQSLDGVG LLFPTPQRLA SAGAEQIAAL GIVRSRADAL IALAQAVSSG EIDLGPAADV
ALCTERLQAL PGIGRWTAQY IALRALGWPD AWPSGDVALI KALGAAGPRE ADALAEAWRP
WRSYATVQLW RRLAEAATAP KP