Gene Phep_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1967 
Symbol 
ID8253071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2269971 
End bp2271647 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content38% 
IMG OID644935618 
Producthypothetical protein 
Protein accessionYP_003092237 
Protein GI255531865 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.929106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000146847 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAATT TAAATTTTTA TGATGCGATT GTAATTGGGT CTGGTATCAG CGGGGGGTGG 
GCAGCAATGG AATTATGTAA AAAGGGATTG AAAACGTTAC TTCTTGAACG TGGAAGAGAT
GTCAAACATA TACAAGATTA TCCCACAGCT AATTTGAACC CGTGGGATTT TGAGTTGGGC
TTTAATAATA CACTAAAAGA TCAGGAGAAT GATCCCATAC AAAGTATGGC ATATACTCCT
GCAGACAAGC ATTTTTATGT GAGTGATAAA GACCAGCCTT ATGTTCAGGA AAAGCCATTT
AATTGGTTTC GCGGCTATCA AGTAGGCGGC AGGTCGTTAC TATGGGGGCG CCAATGTTAT
CGTCTTAGCG ATTTAGATTT TGAGGCAAAT TTGAAAGAAG GCGTAGCCAT AGATTGGCCA
ATAAGATACA AAGACATTGC CAGCTGGTAT AGTTACGTGG AATCATTTAT CGGCGTTAGC
GGAAAACATG AAAATCTATC ACAGTTGCCC GATGGAGAGT TTTTGCCGCC ATTCGAATTG
AATTGCATAG AGGAACATTT AGCTGATTCT ATCCTCAAAA CTGAAGAAAA CAGGTTACTT
ACCCCAGCGC GCGTAGCCAA CCTAACGAAA GGCTGGGATA ATAGAGGACC ATGCCAAAAT
AGAAATCTTT GTACCAGGGG CTGTCCATTT GGTGGTTATT TTAGCAGTAA CAGTTCTACA
ATCCCTTCTG CAATGGCTAC AGGTAATTTA ACATTGAGGC CTTTTTCAAT CGTTGTTGAG
TTAATATATG ATGAACAAGA ACAGAAAGCA AAAGGAGTTA AAGTAATTGA TAGCGTAAGT
AATGAAGTTC ACCTATTTTA TGCAAACATC ATCTTTATCA ATGCATCTAC TATACCAACT
ACAGCATTAT TACTTAACTC CGTTTCCTCA AGGTTTCCAA ATGGGTTTGG AAATGATAGT
GGCCAGGTAG GACATAACCT AATGGATCAT CATTCTTCTG CGGGAGCATT TGGAATGCAT
GATAGTTTTA AAAATCAGTA TTATAAAGGT AGACGGCCTT GTGGATTTTT AATTCCAAGA
TATAGAAACC TGAATAATAA TGAAAATCTT GGTTTTAGCA GAGGGTACAA CATCCAGGGT
CGGGGCCAGC GTCAGGAGTG GGTAGATCTT TCTTCTTCAA ATGGATATGG AAGTCAGTTT
AAAAAAGAAA TCACAACACC TGGTAAGTGG ATGGTATGGA TGGCCGGATG GGGAGAGTGT
TTGCCATATT TTGAAAATCG AGTCAGTCTA GTTCCCGATA AAGTAGATAA ATGGGGGCAA
AAGTTAATAG CTATTGATTT TGAATTTAAG GATAATGAAA GAAAAATGAT GGATGATATC
AAAGACACGG CTACTGAAAT GCTTATAAAA GCTGGTTTTA ATAATATTGA TAGTTTTAAT
TACAATAAAC CAGGAGGTTC CACTGTACAT GAAATGGGCA CGGCAAGAAT GGGGAACGAT
CCAAAAACGT CGGTATTGAA TAGATTTAAC CAAATGCATT CTGTAAAAAA TGTTTTTATA
ACTGATGGAA GCTGTATGAC CTCTTCGGGA TGTCAAAATC CTTCATTAAC TTATATGGCT
TTAACTGCCA GAGCTTGCGA TTACGCTGTT AAGCAATTAA AGCTGGGTAA TCTTTGA
 
Protein sequence
MKNLNFYDAI VIGSGISGGW AAMELCKKGL KTLLLERGRD VKHIQDYPTA NLNPWDFELG 
FNNTLKDQEN DPIQSMAYTP ADKHFYVSDK DQPYVQEKPF NWFRGYQVGG RSLLWGRQCY
RLSDLDFEAN LKEGVAIDWP IRYKDIASWY SYVESFIGVS GKHENLSQLP DGEFLPPFEL
NCIEEHLADS ILKTEENRLL TPARVANLTK GWDNRGPCQN RNLCTRGCPF GGYFSSNSST
IPSAMATGNL TLRPFSIVVE LIYDEQEQKA KGVKVIDSVS NEVHLFYANI IFINASTIPT
TALLLNSVSS RFPNGFGNDS GQVGHNLMDH HSSAGAFGMH DSFKNQYYKG RRPCGFLIPR
YRNLNNNENL GFSRGYNIQG RGQRQEWVDL SSSNGYGSQF KKEITTPGKW MVWMAGWGEC
LPYFENRVSL VPDKVDKWGQ KLIAIDFEFK DNERKMMDDI KDTATEMLIK AGFNNIDSFN
YNKPGGSTVH EMGTARMGND PKTSVLNRFN QMHSVKNVFI TDGSCMTSSG CQNPSLTYMA
LTARACDYAV KQLKLGNL