Gene Cpha266_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2733 
Symbol 
ID4569977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp3124050 
End bp3125063 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content53% 
IMG OID639767301 
Productaminodeoxychorismate lyase 
Protein accessionYP_913141 
Protein GI119358497 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.269056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATCCG ATAAATCATC TGCACAGAAA ACGCTATTTG CAGGGCTGTT CCTGCTCTTA 
CTGTCGCTTT GCGGAATATT TTTCATTCCG GGCCTGAATA CCGCCGGGAT GCCGACCCGG
CTTGTCGTCC ACAGGGGTTC GGGGTTTATG GCTATTGCCG ACACCCTTCG CAGGAACGAA
GCCATTAAAA ACCGCTGGCA GGTTGTGCTT ACCGGCAGAA TGATTCCCGG ATTGCACAAG
ATCAAACCCG GCAGGTATTC CATTCCCCCC GGGTTGTCGA ACTTCGGGCT GTTGCGATAC
CTGCATACAC ACCATCAGGA TGAAGTCCGC ATCACCATTC CGGAAGGTCT GGAGCAACGG
GAAATTGCCA GGAGGATGGC GGGAAAACTC GATATGGACT CTTCCCGCTT CATGAAGGCG
GCAAAAAACG CCGCGCTGCT GTCGAAATAC CGGATATCCG CCCAAAGCGC TGAAGGCTAT
CTGTTCCCCG GTACGTATGA TTTCGCATGG GGCAGTACGC CCGATGAGGT CGCAGGGTTC
CTTATCAGCC GGTTCAGACT GTTTTATTCC GACTCTCTTC AACGCGCGGC GGCGTCAAAA
GGTCTGACTG AGACAAGCCT GCTGACCCTC GCTTCGATCG TTGAGGCAGA AACCCCTCTC
GACGAGGAAA AACCTCTTGT TGCCGGCGTC TATCTCAACC GGTTAAAAAA AGGCATGCGC
CTGCAGGCCG ATCCGACCGT TCAATACGCT CTTGACGGAC CTCCGCGCCA TCTTTATTAC
AAGGATCTTG CCATTGATTC TCCCTATAAT ACCTATCGCT ACGGCGGTCT GCCGCCAGGA
CCGATCTGTA ATCCCGGAAC GGCATCGATA CTTGCCGTTC TCAATCCCGA AGAAACCGGG
TTCATCTACT TTGTCGCAAC AGGAAAAGGT GGTCACTATT TTGCTGAAAC CATCGCTGCG
CATCACGAAA ACATCAGAAA ATACAAGGCG GCCAAGCATG CGTCATTACC CTGA
 
Protein sequence
MLSDKSSAQK TLFAGLFLLL LSLCGIFFIP GLNTAGMPTR LVVHRGSGFM AIADTLRRNE 
AIKNRWQVVL TGRMIPGLHK IKPGRYSIPP GLSNFGLLRY LHTHHQDEVR ITIPEGLEQR
EIARRMAGKL DMDSSRFMKA AKNAALLSKY RISAQSAEGY LFPGTYDFAW GSTPDEVAGF
LISRFRLFYS DSLQRAAASK GLTETSLLTL ASIVEAETPL DEEKPLVAGV YLNRLKKGMR
LQADPTVQYA LDGPPRHLYY KDLAIDSPYN TYRYGGLPPG PICNPGTASI LAVLNPEETG
FIYFVATGKG GHYFAETIAA HHENIRKYKA AKHASLP