Gene Cphamn1_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1387 
Symbol 
ID6375065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1504369 
End bp1505850 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content54% 
IMG OID642683882 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001959796 
Protein GI189500326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0727929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000123065 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGAAATG ATATCCTGCA ACTGAAACCA CAGGAGGTGT GGAGACTTTT TTACAGTCTG 
ACCCGTATTC CGCGTCCTTC AGGACATGAG GATGCAGTCA GGGAGTTTAT CGCCACTTTC
GGGAAAAATC TCGGGCTGAA AACAATGGTA GACAGGGTTG GAAATGTCAT TATCCGCAAA
CCTGCAACAC CCGGCATGGA GAACCGCAAG GGAGTTATTC TGCAGGCCCA TCTCGACATG
GTGCCGCAGA AAAACAGCGG AACAGAGCAT GATTTTTATA ACGATCCCAT CGACATCATT
GTTGACGGGG AGTGGGTTTG TGCCAGAGGT ACTACGTTAG GGGCCGATAA CGGTATCGGC
GTCGCGGCGA TTATGGCTGT GCTTGAATCA GGCCCGATGA GGCATGGTCC TCTTGAAGCG
CTGTTTACCA GCAATGAGGA GAGCGGCATG ACCGGAGCTT TCGGTCTCAA ACCCGGTCTG
CTCCAGGGAA AGATCCTGAT GAACCTGGAT TCCGAACAGG AAGGTGAACT CTCAATCGGT
TGCGCAGGAG GGCTTGACGC AACCATGACC TTCAGATATT CTGAAAAACC TGTTCCGGAT
GGGTATATCG GGTTCAGTAT AGCGGTAACC GGTCTGAAGG GTGGTCACAG CGGTGTGGAT
ATCCACCGAG GCCGCGGGAA CGCCAACAAG ATCCTGAACC GTCTTCTCTA TGAGGGATAT
ACCCGCCATG GAATGCTCCT GAGCGCTATT CACGGCGGAA GCCTTCGCAA TGCGATACCA
CGTGAAGCGA CGGCCATGGT TGCTGTGTCT GCCAGGCAGG CCGAACGGTT TCTTGAAGAC
ACATACAGGT TGGCAGCAAA AATCAAACAA GAGTTTTCGA CGGCTGATCC TGATCTTGAG
ATCGTCGCTG TTTCTGTGGG AGTCCCGGAC AGGGTCATTG AGGAACGTGT TGCGGTCAAT
CTGTTCAACA CGCTCTGTGC CTGCCCGAAC GGGGTGATGC GCATGAGCGA CGAGATGGAA
GGGCTGGTTG AAACATCCAA CAACCTTGCC GTGGTCAAAT CCGGTGAAGG GGCCATGACC
GTGGAGTGTC TGCTGAGAAG TTCCGTGGAT TCCGCTCGGG AAGAACTGGA GAATATGATC
AGGAGCATTT TTCAGCCTGC CGGAGCGGTT TGCCTCTTCG ATGGGGCATA TCCCGGCTGG
AAACCGAACC CGGGTTCACC GGTATTGCAG AGCATGACGG AAATCTACAC GAAAATGTTT
GGAGAAGCAC CGGAAATCCG TGCGGTTCAC GCCGGTCTGG AATGTGGTAT CATCGGCGCG
ACCTATCCCG GACTCGACAT GATCTCTTTC GGGCCTACCA TTCTGTATCC TCATTCTCCT
GATGAAAAGG TGAGCAGTAC TTCCGTCCTG CGATTCCGGG ATTTTCTGGT CGAAACACTT
GCGCAACTGC CATTAGCGGC GCAGGGGAGC CGGTTGCAGT AA
 
Protein sequence
MGNDILQLKP QEVWRLFYSL TRIPRPSGHE DAVREFIATF GKNLGLKTMV DRVGNVIIRK 
PATPGMENRK GVILQAHLDM VPQKNSGTEH DFYNDPIDII VDGEWVCARG TTLGADNGIG
VAAIMAVLES GPMRHGPLEA LFTSNEESGM TGAFGLKPGL LQGKILMNLD SEQEGELSIG
CAGGLDATMT FRYSEKPVPD GYIGFSIAVT GLKGGHSGVD IHRGRGNANK ILNRLLYEGY
TRHGMLLSAI HGGSLRNAIP REATAMVAVS ARQAERFLED TYRLAAKIKQ EFSTADPDLE
IVAVSVGVPD RVIEERVAVN LFNTLCACPN GVMRMSDEME GLVETSNNLA VVKSGEGAMT
VECLLRSSVD SAREELENMI RSIFQPAGAV CLFDGAYPGW KPNPGSPVLQ SMTEIYTKMF
GEAPEIRAVH AGLECGIIGA TYPGLDMISF GPTILYPHSP DEKVSSTSVL RFRDFLVETL
AQLPLAAQGS RLQ