Gene Phep_2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2747 
Symbol 
ID8253855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3244545 
End bp3245795 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content50% 
IMG OID644936395 
ProductRibulose-bisphosphate carboxylase 
Protein accessionYP_003093010 
Protein GI255532638 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.32552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAA TAACCGCAAA ATATTACATA GAAACGCCGC TTGATCTCGA AAAATCGGCC 
CAGTTGCTGG CCGGCGAGCA AAGTTCAGGA ACCTTTATTG CCGTGCCGGG CGAAACCGAA
GAATTGAAAC AGCGCTTTGC CGCCAGGGTA GAAAGCATTA CCCCAATGGA TACAGCTAAT
GAACCCGCCA TACCAGGCGT ACTGTCTGCC GGCGGTAAAT ACCAGCGCGC TATGATCGAG
GTTTCCTGGT CGATAGAGAA TTTCGGCTAT AACCTGCCGG TGATGGTGTC TACCCTGCAG
GGAAATTTAT ATGAGCTGAC CCAGTTTACA GGGCTTAAGC TGATGGATCT GGAATTGCCA
GCTTCTTTTG CTACCGCCTT TAAAGGGCCT AAATTTGGCA TAGCCGGCTG CAGGAAACTG
ACAGGTGTTT ACAACAGGCC CCTGATCGGA ACCATCATCA AACCCAGCAT CGGCATGACG
CCGGAACAAA CGGCTGCTTT GGTAAATACC CTTGCCCTGG CTGGAATTGA TTTCATCAAG
GATGATGAAC TGCTGGGCTC ATCTGCCAAT TCCCCCTTTG ATAAACGGGT GGATGCCATT
ATGGAAGTGA TCAACAGACA TGCTGATCGC AGCGGAAAAA AAGTAATGTA TGCTTTTAAC
ATCAGCGATG ACATCGACCA GATGCAGCGC AATTACGAAA AGATCCTTCG TTCAGGGGGT
ACTTCAGCGA TGATAAGTCT CAATAGTGTT GGGCTGGCAG GGGTTAAGAA GATTGGCGAA
ATAGGGGAGC TGGCTATTCA TGGCCACCGT AATGGCTGGG GTATGCTCAA CCGTCACCCT
TTACTGGGTA TAGAGTTTCC TGCCTATCAG CAGCTTTGGC GTTTGGCCGG GGTCGACCAG
ATCCATGTAA ATGGCATACA AAACAAATTC TGGGAATCTG ACGATTCTGT AGTGCGTTCT
ATTGAAGCCT GCTTCAAACC CTTATTGGGT GGCTATTCGG TTTTACCAGT GGTATCCTCG
GGGCAGTGGG GCGGGCAGGC TGTTGAAACC TACCGGCGCG TACCCTCTGT AGACTTGTTA
TATATGGCCG GAGGTGGAAT TATGGCGCAT CCAGACGGTC CTGCAGGTGG CGTAGTAGCT
TTACAACAGG CCTGGCAAGG TGCTGTAGAT GGCCTGTCAG TGGCTGAAAC AGCTGCAAAA
TATCCTGAAT TTGGACATTC GGTAAGTGTA TTCGGTAAAA AACAGGCCTA G
 
Protein sequence
MERITAKYYI ETPLDLEKSA QLLAGEQSSG TFIAVPGETE ELKQRFAARV ESITPMDTAN 
EPAIPGVLSA GGKYQRAMIE VSWSIENFGY NLPVMVSTLQ GNLYELTQFT GLKLMDLELP
ASFATAFKGP KFGIAGCRKL TGVYNRPLIG TIIKPSIGMT PEQTAALVNT LALAGIDFIK
DDELLGSSAN SPFDKRVDAI MEVINRHADR SGKKVMYAFN ISDDIDQMQR NYEKILRSGG
TSAMISLNSV GLAGVKKIGE IGELAIHGHR NGWGMLNRHP LLGIEFPAYQ QLWRLAGVDQ
IHVNGIQNKF WESDDSVVRS IEACFKPLLG GYSVLPVVSS GQWGGQAVET YRRVPSVDLL
YMAGGGIMAH PDGPAGGVVA LQQAWQGAVD GLSVAETAAK YPEFGHSVSV FGKKQA