Gene Lcho_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0366 
Symbol 
ID6162689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp400365 
End bp401585 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content74% 
IMG OID641663115 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001789406 
Protein GI171057057 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000015812 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGTCGA CCTCCCACGT CATCCCGACC CTGCTGCCCG GCCAGGCCTC GCCGGCTGCG 
CCGCAGCGCC TGGCCACGCT CGGCGTGATG GGCGGCGGCC AGCTCGGGCG CATGTTCGTG
CACGCCGCGC AGCAGCTCGG TTTCCGCACC GCGGTGCTCG AACCTGACGC CACCAGCCCG
GCCGGCCTGG TCGCCCACGC GCATGTGGTG AGCGACTACC TCGACGCCGC CGGGCTCGAT
CGCCTGGCCG ACGAGGCCGA CGCCATCACC ACCGAGTTCG AGAACGTGCC CGCCCAGGCG
CTGCGCCAGC TCGCCTTGCG CCGGCCGGTG GCGCCCGCGG GTGACGTGGT CGCGGTCTGC
CAGGACCGCG CCGCCGAGAA GGCGCATTTC GCCGCCAGCG GCGTGCCCTG CGCGCGCAAC
CACCTGATCG AGACCGAAGC CGACCTGGCT GCGGTCGACG CCGCGCTGCT GCCCGGCATC
CTCAAGACCG CGCGCCTCGG TTACGACGGC AAGGGCCAGC GCCGCGTGTC TGACCGCGCC
GAGCTGGCCG CCGCGTGGCG CGAGCTGCAA GGCGTGCCGT GCCTGCTCGA ACAGATGTTG
CCGCTGGCGC AGGAGCTGAG CGTGATCGTC GCCCGCTCGG CCACCGGCGA GCTGGTGCAC
CTGCCGGTGC AGCAGAACCT GCACCGCGAC GGCATCCTCG CCGTCACCGT CGTGCCCGCG
CCCGACATCG ATGCCGCGAC GCAGGCCGAA GCGGTGGCCG CGGCCGGCCG CATCGCGCAC
GAACTGGCCT ACGTGGGCGT GCTCTGCATC GAGTTCTTCG TGCTCGCCGA CGGCCGGCTG
GTGGCCAACG AGATGGCGCC GCGGCCGCAC AACTCCGGCC ATCACAGCGT CGACAGCTGC
GATATCTCGC AGTTCGAGCT GCAGGTGCGC ACGCTCGCCG GCCTGCCGCT GGCCGAGCCG
CGCCTGCATT CGAGCGCGGT GATGCTCAAT CTGCTGGGCG ACCTCTGGTT CGACGCCGCC
GCGGGTGCCG ATGCCGCCGA GCGCACGCCC GACTGGCGGC CGATCCTGGC GCTGCCCGGC
GCGCACCTGC ACCTGTACGG CAAGCAGTCG GCGCGGCGGG GCCGCAAGAT GGGCCACCTG
ACCTTCACCG CGGCCTCCCC GCAGGCCGCG CGTGATGTTG CCCTGCAGGC CGCGGCGCTG
CTCGGCATCG AGCCGTTCTG A
 
Protein sequence
MTSTSHVIPT LLPGQASPAA PQRLATLGVM GGGQLGRMFV HAAQQLGFRT AVLEPDATSP 
AGLVAHAHVV SDYLDAAGLD RLADEADAIT TEFENVPAQA LRQLALRRPV APAGDVVAVC
QDRAAEKAHF AASGVPCARN HLIETEADLA AVDAALLPGI LKTARLGYDG KGQRRVSDRA
ELAAAWRELQ GVPCLLEQML PLAQELSVIV ARSATGELVH LPVQQNLHRD GILAVTVVPA
PDIDAATQAE AVAAAGRIAH ELAYVGVLCI EFFVLADGRL VANEMAPRPH NSGHHSVDSC
DISQFELQVR TLAGLPLAEP RLHSSAVMLN LLGDLWFDAA AGADAAERTP DWRPILALPG
AHLHLYGKQS ARRGRKMGHL TFTAASPQAA RDVALQAAAL LGIEPF