Gene Clim_1674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1674 
Symbol 
ID6353981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1840421 
End bp1841911 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content49% 
IMG OID642669279 
Productanthranilate synthase component I 
Protein accessionYP_001943695 
Protein GI189347166 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00452514 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATAC CGTTGAGGAG CCCTTCGTTT CTTCTGAAGC CGCTCGTAAG AGAGGTTCAT 
GCCGATACGG AGACCCCTGT GTCAGTCTAT CTCAAACTGC AGCGCCCCTA TTCCTGTCTT
CTTGAATCCG TTGAGGGCGA AGAGCATCTT GCACGGTTTT CCTATATCGC CATTGATCCC
GTAGCTGTTC TGAAAGGTAC CGTCGGAGGA GCGGTATCCC TTGAGGTTTT CAATGAACAG
TTCGGTTCAC TCAGGAAGAT TGTCAATGAA GAGAAAGATT TAAGGAAGAT TATCGATTTA
TCTCTGCAGC AGTTTGACAC CGATGAAATT CAGGGCAGAA AAAACGGAAC GGACCAGATG
ATAACCTCAG GTGTGTTCGG TTATTTCAGT TACGATGCCA TGCATCTTGT TGAAAAAATA
CCTGCTGCGC TTCTGCCCGA TCCGGCAGGC ATGGATGATA TCGTGCTGCT TTTCTGCGAT
ACGCTTGTTG TGTTCGACAA CATCATGCGA AAGGTCTTTA TTATTGCTAA TTATCTCGAT
GAGAGCGGTG TTGCCCGGGC TGAAGACAAA ATTGATGCTA TCGCCGGTCA TATGCTGCGT
CCGCTTGGCT CCGAGGAGGT GCTGCTGAAA TCAGAAAAGC CGGAAGAGGT GGTTTCCAAT
ACCACAAGAG AGGAATATCT GGCAAAGGTG GATCAGGCGA AGGAGTATAT TCTTATGGGG
GATATTTTTC AGGTGCAGAT ATCCCAGCGC CTGCGCCGTC CCCTTCATAC CCGGCCGTTC
GATGTATACA GGATGTTGCG GACCATCAAC CCTTCGCCCT ATCTCTACTA TTTCGATCTG
GGAGAAGCGA AAATCGTTGG TTCTTCTCCC GAACTGCTCG TAAAGGTTCA TCATGACCCG
AATGGACGGC GGATGGTAGA TACCAGGCCT ATTGCCGGAA CCAGAAAGCG AGGAGCCACC
TTTGAGGAGG ACGAACTCAT TGCAGCGGAA CTGCTCTCCG ATGAAAAGGA GTGCGCTGAA
CATCTCATGC TGATCGATCT GAGCCGGAAC GATATCGGAC GCATTGCCAA GGTCGGAACG
GTCGATACCA ATGAGATGAT GATCATTGAA AAGTACTCGC ACGTCATGCA CATCGTCAGT
AACGTACGAG GTGAGCTCAG GGACGATCTC GGTACCATGG ATGCTTTCTG GTCATGTTTT
CCGGCAGGTA CACTGACCGG CGCACCAAAA GTGCGTGCCA TGGAGATTAT CTATGAGCTT
GAACACGAGA AGCGCGGATT GTATGGTGGT GCGGTTGGTT TTCTTGACTT CAAGGGAAAC
CTTACGACTG CGATTGCAAT ACGTACGATG GTTGTGGAGA ACGGGACGAT CTATTTTCAG
GCTGCTGGCG GGATTGTTGC CGACTCAAAA CCGGAAAGTG AATATGAAGA GACGATGAGC
AAGATGAGAG CCGGTTTGAC TGCTGTTGAG AATATTGAAG CTTTGCCGTA A
 
Protein sequence
MSIPLRSPSF LLKPLVREVH ADTETPVSVY LKLQRPYSCL LESVEGEEHL ARFSYIAIDP 
VAVLKGTVGG AVSLEVFNEQ FGSLRKIVNE EKDLRKIIDL SLQQFDTDEI QGRKNGTDQM
ITSGVFGYFS YDAMHLVEKI PAALLPDPAG MDDIVLLFCD TLVVFDNIMR KVFIIANYLD
ESGVARAEDK IDAIAGHMLR PLGSEEVLLK SEKPEEVVSN TTREEYLAKV DQAKEYILMG
DIFQVQISQR LRRPLHTRPF DVYRMLRTIN PSPYLYYFDL GEAKIVGSSP ELLVKVHHDP
NGRRMVDTRP IAGTRKRGAT FEEDELIAAE LLSDEKECAE HLMLIDLSRN DIGRIAKVGT
VDTNEMMIIE KYSHVMHIVS NVRGELRDDL GTMDAFWSCF PAGTLTGAPK VRAMEIIYEL
EHEKRGLYGG AVGFLDFKGN LTTAIAIRTM VVENGTIYFQ AAGGIVADSK PESEYEETMS
KMRAGLTAVE NIEALP