Gene Phep_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3795 
Symbol 
ID8254929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4553101 
End bp4554516 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content45% 
IMG OID644937459 
Productregulatory protein GntR HTH 
Protein accessionYP_003094048 
Protein GI255533676 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAG AAGTATTGTA CCTGAAAATT GCCGGTACGG TTGAGCAGCA GATTGCTTCC 
GAAACCCTCA AGGTGGGCGA CAAATTGCCC TCTATCCGTA CCGTACAAAA GATTTACAAT
GTAAGCCTGA ATACCGTTAA ACAAGCATTT TTAGAACTGG AAAGCAAATC GCTGATCGAA
TCCAGGCCTA AATCGGGCTA TTATGTCAGC AATTCTTATG GCCGCAGGCT TTCCATTCCC
GATACCAGCA AGCCAAAGTT GTACGATAAA GAAAAACATC CTGAAGACCT GATCACCAGG
GTTTTTGATA CCCTGTACTA CAAAGACATT ACCAAGTTTT CCTTAGGTGT ACCTGCCAAC
GGCTTATTAC CTATTGCCAA ATTAAACAAA GGGGTAATCA AAGCCATGCG TAACCTGGAA
GGGAGTGGTA CGGCCTACGA ACCTGTACAG GGCAGTGTAA ACTTACGGCG GAACATATCC
CGCTGGTCTT TTGTTTGGGC CGGTCAGCTC ACCGAGAATG ATGTGGTAAC TACCTCGGGT
GCTATGAATG CCCTGTACAA TTGCTTTATT GCCGTTACCA AACCTGGTGA TACCATTGCC
CTGGAAAGCC CTATATATTT TGGGATATTA CAGATGGCGC GCGAGCTGGG CCTAAAAATT
ATTGAACTGC CTACTCATCC GGTTACCGGG ATTGAAATAG ATGCTTTGAA AAAAATCCTG
CCCGTTATTA AAGCCTGTTG CCTGGTCAGC AATTTCAACA ATCCTCTCGG CAGCTGTATG
CCCGATGAGC ATAAAAAAGA GGTGGTCAGG CTGCTTACCC AGCATAACAT ACCATTGATT
GAGGATGACC TTTATGGCGA TATTTTTTTC GGAACTACAC GTCCGGTACC CTGCAAATAC
TATGATGAGG CCGGAATGGT CATGTGGTGC GGTTCCTTTT CCAAGTCCCT GGCTCCGGGT
TACAGAGTAG GCTGGGTAGC TCCGGGGAAG TTCAAAGAAA AGATCATCAG GCAGAAATTG
TTACAAACCA TATCAACACC CCCGCTTTAC CAGGAAGTAA TTGCCGATTT TATGGAGCAT
GGGCGTTACG ACCATCACCT GCGGACTTTA AGACATAAGT TATATACCAA CTGCCTTCAA
TACCAGCGGG CAATAACAGA TTACTTTCCC GAAAACACAA AGGTAACACA ACCCCAGGGA
GGCTTTGTAT TGTGGCTGGA GCTCGATAAG AGAATAGATA CCGCAATGCT CTATAACCAG
GCCATCAAAC AGAACATCAG TTTTGCGCCC GGCCGCATGT TTACCCAACA CAATCAGTTT
AACAATTGCA TGCGCTTAAA TTTTGGGCTC AAATGGGATG AAAAACTGGA GTCTGACTTA
AAACGGCTGG GGAAGATTGT GAAGAACGCT TTATAG
 
Protein sequence
MSKEVLYLKI AGTVEQQIAS ETLKVGDKLP SIRTVQKIYN VSLNTVKQAF LELESKSLIE 
SRPKSGYYVS NSYGRRLSIP DTSKPKLYDK EKHPEDLITR VFDTLYYKDI TKFSLGVPAN
GLLPIAKLNK GVIKAMRNLE GSGTAYEPVQ GSVNLRRNIS RWSFVWAGQL TENDVVTTSG
AMNALYNCFI AVTKPGDTIA LESPIYFGIL QMARELGLKI IELPTHPVTG IEIDALKKIL
PVIKACCLVS NFNNPLGSCM PDEHKKEVVR LLTQHNIPLI EDDLYGDIFF GTTRPVPCKY
YDEAGMVMWC GSFSKSLAPG YRVGWVAPGK FKEKIIRQKL LQTISTPPLY QEVIADFMEH
GRYDHHLRTL RHKLYTNCLQ YQRAITDYFP ENTKVTQPQG GFVLWLELDK RIDTAMLYNQ
AIKQNISFAP GRMFTQHNQF NNCMRLNFGL KWDEKLESDL KRLGKIVKNA L