Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1387 |
Symbol | |
ID | 6375065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1504369 |
End bp | 1505850 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642683882 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_001959796 |
Protein GI | 189500326 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0727929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000123065 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGGAAATG ATATCCTGCA ACTGAAACCA CAGGAGGTGT GGAGACTTTT TTACAGTCTG ACCCGTATTC CGCGTCCTTC AGGACATGAG GATGCAGTCA GGGAGTTTAT CGCCACTTTC GGGAAAAATC TCGGGCTGAA AACAATGGTA GACAGGGTTG GAAATGTCAT TATCCGCAAA CCTGCAACAC CCGGCATGGA GAACCGCAAG GGAGTTATTC TGCAGGCCCA TCTCGACATG GTGCCGCAGA AAAACAGCGG AACAGAGCAT GATTTTTATA ACGATCCCAT CGACATCATT GTTGACGGGG AGTGGGTTTG TGCCAGAGGT ACTACGTTAG GGGCCGATAA CGGTATCGGC GTCGCGGCGA TTATGGCTGT GCTTGAATCA GGCCCGATGA GGCATGGTCC TCTTGAAGCG CTGTTTACCA GCAATGAGGA GAGCGGCATG ACCGGAGCTT TCGGTCTCAA ACCCGGTCTG CTCCAGGGAA AGATCCTGAT GAACCTGGAT TCCGAACAGG AAGGTGAACT CTCAATCGGT TGCGCAGGAG GGCTTGACGC AACCATGACC TTCAGATATT CTGAAAAACC TGTTCCGGAT GGGTATATCG GGTTCAGTAT AGCGGTAACC GGTCTGAAGG GTGGTCACAG CGGTGTGGAT ATCCACCGAG GCCGCGGGAA CGCCAACAAG ATCCTGAACC GTCTTCTCTA TGAGGGATAT ACCCGCCATG GAATGCTCCT GAGCGCTATT CACGGCGGAA GCCTTCGCAA TGCGATACCA CGTGAAGCGA CGGCCATGGT TGCTGTGTCT GCCAGGCAGG CCGAACGGTT TCTTGAAGAC ACATACAGGT TGGCAGCAAA AATCAAACAA GAGTTTTCGA CGGCTGATCC TGATCTTGAG ATCGTCGCTG TTTCTGTGGG AGTCCCGGAC AGGGTCATTG AGGAACGTGT TGCGGTCAAT CTGTTCAACA CGCTCTGTGC CTGCCCGAAC GGGGTGATGC GCATGAGCGA CGAGATGGAA GGGCTGGTTG AAACATCCAA CAACCTTGCC GTGGTCAAAT CCGGTGAAGG GGCCATGACC GTGGAGTGTC TGCTGAGAAG TTCCGTGGAT TCCGCTCGGG AAGAACTGGA GAATATGATC AGGAGCATTT TTCAGCCTGC CGGAGCGGTT TGCCTCTTCG ATGGGGCATA TCCCGGCTGG AAACCGAACC CGGGTTCACC GGTATTGCAG AGCATGACGG AAATCTACAC GAAAATGTTT GGAGAAGCAC CGGAAATCCG TGCGGTTCAC GCCGGTCTGG AATGTGGTAT CATCGGCGCG ACCTATCCCG GACTCGACAT GATCTCTTTC GGGCCTACCA TTCTGTATCC TCATTCTCCT GATGAAAAGG TGAGCAGTAC TTCCGTCCTG CGATTCCGGG ATTTTCTGGT CGAAACACTT GCGCAACTGC CATTAGCGGC GCAGGGGAGC CGGTTGCAGT AA
|
Protein sequence | MGNDILQLKP QEVWRLFYSL TRIPRPSGHE DAVREFIATF GKNLGLKTMV DRVGNVIIRK PATPGMENRK GVILQAHLDM VPQKNSGTEH DFYNDPIDII VDGEWVCARG TTLGADNGIG VAAIMAVLES GPMRHGPLEA LFTSNEESGM TGAFGLKPGL LQGKILMNLD SEQEGELSIG CAGGLDATMT FRYSEKPVPD GYIGFSIAVT GLKGGHSGVD IHRGRGNANK ILNRLLYEGY TRHGMLLSAI HGGSLRNAIP REATAMVAVS ARQAERFLED TYRLAAKIKQ EFSTADPDLE IVAVSVGVPD RVIEERVAVN LFNTLCACPN GVMRMSDEME GLVETSNNLA VVKSGEGAMT VECLLRSSVD SAREELENMI RSIFQPAGAV CLFDGAYPGW KPNPGSPVLQ SMTEIYTKMF GEAPEIRAVH AGLECGIIGA TYPGLDMISF GPTILYPHSP DEKVSSTSVL RFRDFLVETL AQLPLAAQGS RLQ
|
| |