Gene Clim_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1142 
Symbol 
ID6353658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1239600 
End bp1241102 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content57% 
IMG OID642668759 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001943190 
Protein GI189346661 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0672748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAG ATATTCGCGA GCTTGACCCT CGGGAGGTGT GGAGGCATTT TCACAGCCTT 
ACCCGGATTC CTCGTCCTTC GGGGCACGAA GAGAAAGTCA GGGCGTTCAT CGCAGGTTTC
GGACGGAGTC TCGGCCTGGA TACAACGGTC GATGAAGCGG GAAACGTCAT TATCCGCAAA
CCCGCTACAT CCGGAATGGA AGAGTATCGG GGGATTATTC TGCAGGCCCA TCTCGACATG
GTGCCGCAGA AAAACGGCGG TACGTTGCAT GATTTTGAAA CCGATCCTAT CGAACCCATT
GTCGATGGCG GGTGGGTGCG AGCCCGCGGT ACCACGCTTG GCGCCGATAA CGGTATTGGC
GTGGCGGCGG CCATGGCCGT ACTGGAATCA GGCGACTTGC GGCACGGTCC GCTGGAAGCG
CTTTTCACGT CCGAAGAGGA GAGCGGTATG GCCGGAGCCT TGCGACTGAA ACCCGGCATG
CTCAAAGGCG GAATACTTCT GAACCTTGAT TCTGAAGACG AGGGGGAGCT GTTCATCGGT
TGTGCGGGCG GCCTTGATGC AACAATGACT TTCAGCTATG ACGAACAGGT CGTTCCTGCC
GGTTACGAAG GGTACATGCT CAGGGTGAAC GGGTTGCGTG GCGGACACAG CGGCATGGAT
ATTCATCTCG GCCGAGGTAA CGCCAACAAA ATCATGAACC GGTTGCTGCA TCAGGGGTAC
CTGCGTCATG GAATGCTGAT CGGTGCGATC GAGGGCGGAA CCCTCCGCAA CGCCATTCCG
CGTGAATCCT CGGCTCTGGT GGTCGTGCCT GCCTTGCAGA GGGATGGGTT TCTTGATGGG
CTTGGCCGAC TTGGCGCTGA TATAAAAAAC GAGCTTGCTT CCGCCGATCC CGGAGTGAGG
ATTGAAGCGG TCTCTGCCGC GTTACCGGAG CTGGTTATCG GGGAGCCTGT TGCTGAGAGG
ATGCTCAGGG CGATCCACGC CTGTCCGGAC GGGGTGATGC GCATGAGCTG CGAGATGGCC
GGTGTGGTAG AAACTTCCAG CAATCTTGCA ATCGTCACCT CCAGCGACGG AGAAATTACT
GTTCAGTGCC TGCTCCGCAG TTCGGTGGAT TCCGCCCTCG AGGAGCTTGC GACAATGATC
GGCAGCGTGT TCGAACTGGC CGGAGCGGTT GCGGTATTCG ACGGAGGCTA TCCCGGCTGG
AAACCCGATC CCGGATCTCC GGTTCTGAAA GGCATGCTGG AGATCTACCA TGAAAAATTC
GGGACGACTC CGGAAGTCAA GGCTGTGCAT GCCGGTTTGG AATGCGGAAT CATCGGAAGC
ATCTACAGCG AGATCGATAT GATCTCCTTT GGTCCCACAA TCCGTTATCC GCACTCGCCG
GATGAAAAAG TGGAAATCGC ATCGGTCGAA AAATTCTGGG ATTTTCTCGT TGAGACAATC
GGCAGGGTTT CTTCAGGGTC AGGAGGCTGT GGTTTCATCC CGGCAGGACG AACCCATCGG
TGA
 
Protein sequence
MSTDIRELDP REVWRHFHSL TRIPRPSGHE EKVRAFIAGF GRSLGLDTTV DEAGNVIIRK 
PATSGMEEYR GIILQAHLDM VPQKNGGTLH DFETDPIEPI VDGGWVRARG TTLGADNGIG
VAAAMAVLES GDLRHGPLEA LFTSEEESGM AGALRLKPGM LKGGILLNLD SEDEGELFIG
CAGGLDATMT FSYDEQVVPA GYEGYMLRVN GLRGGHSGMD IHLGRGNANK IMNRLLHQGY
LRHGMLIGAI EGGTLRNAIP RESSALVVVP ALQRDGFLDG LGRLGADIKN ELASADPGVR
IEAVSAALPE LVIGEPVAER MLRAIHACPD GVMRMSCEMA GVVETSSNLA IVTSSDGEIT
VQCLLRSSVD SALEELATMI GSVFELAGAV AVFDGGYPGW KPDPGSPVLK GMLEIYHEKF
GTTPEVKAVH AGLECGIIGS IYSEIDMISF GPTIRYPHSP DEKVEIASVE KFWDFLVETI
GRVSSGSGGC GFIPAGRTHR