Gene Phep_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4053 
Symbol 
ID8255187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4897446 
End bp4898549 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content48% 
IMG OID644937717 
Productcarboxylate-amine ligase 
Protein accessionYP_003094306 
Protein GI255533934 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00732082 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACG ATTTTACCCT CGGTATAGAG GAAGAATATA TGGTAACAGA CCCTGTGACC 
AGGGAACTGA CCTCGCACGA CCAGAAGATA GTTGAAGCGG CACAAAAGAT ACATAAAGAC
CAGGTAAAGG CCGAGATGCA CCAGGCGGTG GTAGAAGTGG GTACCGGAAT TTGCAGGAAT
ACCGACCAGG CAAGGGCGGA AATTTCGCAG CTGCGCTATA CGGTTTCGCA ACTGGCCGGC
GAGCTGGGAC TTAGGATTGG TGCTGCCGGT ACACATCCTT TTTCCCATTG GGAAAAACAG
CTGATTACCG AACATCCCCG TTACAGCGAC ATTGTGAACG AGCTGCAGGA GGCCGCGCGC
TCTAACCTGA TTTTTGGACT GCATGTACAT GTGGGCTTCC AGTCGCGCGA GCTGGCCATA
CACATTGCTA ACCAGGTGCG CTATTTTTTA CCGCATGTTT TTGCCCTTTC AACCAATTCC
CCCTTCTGGG AAGGGAGAAA TACAGGGTAC AAATCGTTCC GCACCAAGGT TTTTGACAAA
TTTCCGCGAA CAGGCATTCC CGATATTTTT AACAGCATTG AAGATTATGA CAATTATGTA
AAGCTGCTGA TCAAGACCAA CAGCATTGAC AATGCCAAAA AAATCTGGTG GGACATCAGG
GTGCATCCTT TTTTTGAAAC CATAGAATTC AGGATCTGTG ATTGCCCCAT GCTGATCGAT
GAAACCATGG CCTTTGTTGC CTTGTTTCAG TCCTTGTGCG CAAAACTGTA CAAGCTGCGC
CTGCAAAACA TGAAGTTCAT CAGCTATTCC AGGGCACTGA TCAATGAGAA TAAATGGCGG
GCCGCACGTT ATGGAATTGA TGGTAACCTG ATTGATTTTG GGAAAGAAAT GGAGGTAAAC
TGTCGCAACC TGGTACTGGA GCTACTGGAT TTTGTGGACG ATGTAGTGGA CGACCTGGGT
TGCCGCAGGG AGATCAATTA TGTAAGCCAG ATACTGGCCA ACGGAACTGG TGCCGACAGG
CAATTGGCTG TTTACGAACA ATTTGGTAAC TTTGAGGCAG TGGTAGATTA CATTACCACG
CAAACTTTAA TTGGGGCTAA ATAG
 
Protein sequence
MMNDFTLGIE EEYMVTDPVT RELTSHDQKI VEAAQKIHKD QVKAEMHQAV VEVGTGICRN 
TDQARAEISQ LRYTVSQLAG ELGLRIGAAG THPFSHWEKQ LITEHPRYSD IVNELQEAAR
SNLIFGLHVH VGFQSRELAI HIANQVRYFL PHVFALSTNS PFWEGRNTGY KSFRTKVFDK
FPRTGIPDIF NSIEDYDNYV KLLIKTNSID NAKKIWWDIR VHPFFETIEF RICDCPMLID
ETMAFVALFQ SLCAKLYKLR LQNMKFISYS RALINENKWR AARYGIDGNL IDFGKEMEVN
CRNLVLELLD FVDDVVDDLG CRREINYVSQ ILANGTGADR QLAVYEQFGN FEAVVDYITT
QTLIGAK