Gene Cphamn1_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0904 
Symbol 
ID6374571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp978325 
End bp979818 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content49% 
IMG OID642683406 
Productanthranilate synthase component I 
Protein accessionYP_001959330 
Protein GI189499860 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACATA CAGGTCGTTC GACCGCTTTT ATACTGAATC CTCTCATTAA AACCGTACAT 
GCGGATACGG AAACCCCTGT TTCAGTATAC CTGAAACTCA GGGATGCCTA TTCCTGTCTG
CTCGAGTCCG TTGAAGGCGA AGAGAAACTC GCCAGATTTT CCTATATCGC CATTGATCCG
GTTGCAGTTC TGAAAGGTTC CGTTGACGGA GCCACGACAC TTACTATCAG GGATGAACGT
TTCAGGAGTC TTGAAAGGAT TATCGAACAG GAACATGATC TGAGGAAAGT GATCGACAGT
TGTCTCGATT CGTTCGGATC ATCCCAGCGA ACCGAAAGCA TTGTAGGGAC ATCCCCGATG
ATCACCTCAG GCGCGTTCGG TTACTTCAAC TATGATACGA TGCATCTTGT CGAACGGATA
CCTCTGGCCC GTAAACCTGA CCCGTCGGGC CTTCCTGATG TCTGCCTGAT GTTCTGCGAC
AAGCTGGTTG TCTTTGATAA TGTTAAGCGC AAGGTGTTTG TCATTGTCAA CTACCTTGAG
GATGAGGATC GAAAACGTGC TGAAGCGACG ATTGAATCCA TTTGCGCAAA AATGTTTCTG
CCGCTTGATA CGGAGCTCAT GCGGCTGGTT CCTGAAAAGA AAGAAGCGGT TGTTTCCAAT
ACCCGAAAAG AGGAGTATCT CGACAAAGTA GATGTTGCCA AGGAGTATAT CAGGGCGGGC
GATATTTTTC AGGTTCAGGT CTCGCAGAGA CTGCGGCGGC CGTTGAATTC AAGAGCGTTT
GATGTCTACC GCATGTTGCG TACCATAAAC CCGTCGCCCT ACCTGTATTA TCTCGATCTG
GGTGATTTTG AAATTGTCGG TTCATCTCCG GAATTGCTGG TCAAGGTCAG CGCCGACGCG
AAAGGGCGGA GGATCGTCGA TACCCGGCCG ATAGCAGGAA CAAGGCCCAG GGGGAAAACC
TATGAAGAAG ACGCGCAGAT CGAAAAAGAG CTGCTTTCGG ATGAAAAAGA GCTTGCTGAA
CATCTCATGC TGATTGACCT GAGCAGGAAT GATATCGGTA GAATAGCCAA GGTGGGAACG
GTCGAGACAA ACGAGATGAT GATCATCGAA CGGTATTCTC ATGTTATGCA CATTGTCAGC
AATGTTCGTG GTGAACTGCA GGACGACTAC AGCCCGATGG ACGCATTCTG GGCATGTTTT
CCCGCAGGGA CATTGACCGG TGCCCCCAAG GTCAGGGCGA TGGAGATCAT CTGCGAACTC
GAAGAGGAAA AAAGAGGTCT GTATGGCGGG GCTGTCGGGT TTATTGATTT TCGGGGGGAG
CTTGAAACTG CTATCGCGAT AAGAACGATG GTGGTCAGAG ACAATGTTAT TTATTTTCAG
GCGGCGGGAG GTATTGTCGC CGACTCTGTC CCTCTGAATG AATTTGAAGA AACCATGAAC
AAGATGAGAG CCGGATTGCG GACTGTGGAA GCGCTTGAAG AATATCGGGG GTAA
 
Protein sequence
MSHTGRSTAF ILNPLIKTVH ADTETPVSVY LKLRDAYSCL LESVEGEEKL ARFSYIAIDP 
VAVLKGSVDG ATTLTIRDER FRSLERIIEQ EHDLRKVIDS CLDSFGSSQR TESIVGTSPM
ITSGAFGYFN YDTMHLVERI PLARKPDPSG LPDVCLMFCD KLVVFDNVKR KVFVIVNYLE
DEDRKRAEAT IESICAKMFL PLDTELMRLV PEKKEAVVSN TRKEEYLDKV DVAKEYIRAG
DIFQVQVSQR LRRPLNSRAF DVYRMLRTIN PSPYLYYLDL GDFEIVGSSP ELLVKVSADA
KGRRIVDTRP IAGTRPRGKT YEEDAQIEKE LLSDEKELAE HLMLIDLSRN DIGRIAKVGT
VETNEMMIIE RYSHVMHIVS NVRGELQDDY SPMDAFWACF PAGTLTGAPK VRAMEIICEL
EEEKRGLYGG AVGFIDFRGE LETAIAIRTM VVRDNVIYFQ AAGGIVADSV PLNEFEETMN
KMRAGLRTVE ALEEYRG