Gene Phep_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3874 
Symbol 
ID8255008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4654484 
End bp4656190 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content45% 
IMG OID644937538 
ProductRagB/SusD domain protein 
Protein accessionYP_003094127 
Protein GI255533755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.483678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA AAAGATTTAT GTTAGGGTTG ATGGCAATTC TAACCACCTT TATGGGCTGT 
AAGGATTACC TGGACAGGGA AATTCCAACT AATGTTAAGG ACGATCAGGT ATTTGTGAAC
TATGACCGCA TTTCACAGGC AGGATATGGG GCTTATGCCT TCCTTTTTAA TACCATTGGC
TATAACCGGA TCAATGGTGC CATGCTGGCC TCAGGTTGTG ATGAAGCCGA TCATGCAGAC
AATATTTCAA GTATTCAGCG GTTTAATACG GGCACCTGGA ATGCTACATT TAATCCTGAA
GATGTATGGG GACTGTTTTA CCAGGGCATC AGGCGGGCAA ATCTTTTTCT GGAAGAATCG
GCAGATTTTA AAAACCTGAT CTACAGAGAT ACCATTAATG TAACCAATAA AGACCTTTAT
AAATACAGAG TACGGGATTT AGAATGGTTA AGGGCCGAAA ACCGCTTTTT ACGGGCATAT
TACTATGCTG AGCTTATTAA ACGTTATGGG GGAGTACCTA TTCTGCTAAA ATCTGTAACG
GATATTGATG AACTGAATAC TTATAAGCGA AAAACTTATG AGGAATGTGT TCAATTTATT
TCCGATGAGT GTGATGCGGT AGTGCCCCTG CTGAACGAGA GCTGGGTAGG TTTTGATGGC
GACAAGTGGC GTGGCAGGGT AACCAAAGGA GCGGCAATGG CATTGAAAGC AAGGGTATTG
CTATATGCTG CCAGTCCGCT TAATAATGCA TCCAATGATA TTACCAAATG GCAAAAGGCC
GCAAAAGCAG CGCACGATGT AATTGCCCTG AATAAGTATG GCTTGCATAC TGACTATAGA
GGATTGTTTA GGTTGGGGAA TGGGGCTGAT GGAAACCCGG AGATCATTTT CGCACAGCAG
GGTTATAACA GAAACGATTA TGAAAAATAC AATTACCCTA TTGGCTATGA CCAGGGGGGG
TTAGGGAGCA CCTCTCCATC ACAGAATCTG GTAGATGCTT ATGAAATGAA AACTACGGGC
CTTGCTATAA CTGAAAATGG GTCAGGTTAT GATCCGGCCA ATCCATATGC GAACAGAGAT
CCACGCTTAG GGCTTAGCAT ACTGGTCAGC AATACTTCCT TTAAAGGACG CCCGGTGGAA
GCCTGGGTAG GTGGTTTGGA TGGGCTTGGT AAATTTAAGG CAACTACGAC CGGTTACTAT
ATCCGCAAAT ACGTGGACGA AAACCTGAAC CTGGCCCAGG GGGCGACCAG CTTGCATACC
TGGATGATTT TCCGGTATGC AGAGGTGCTG TTAAACTATG CCGAAGCGAT GAATGAAGCT
TATGGCCCCG ACGTTACGGC CGGTTATAGC ATGTCGGCAA AAAAAGCCGT AGATATGGTC
AGGGCCCGGA CGGGTATTGC TATGCCACCT CTTCCTCCCG GTCTTTCAGT TGATGAAATG
CGTTTACGCA TCAGAAACGA ACGACGGGTT GAGCTTGCAT TTGAAGAACA CCGTTTCTTT
GATGTCAGGA GATGGAAAAT TGCTGCACAA ACAGAGAATA GACCGGTAAT GGCCATGAAG
ATCACGAAAA ATACAAATGG AAGTTTTAGT TATCTGGTGG TTAAGGCGGA AGACAGGACA
TTTAGCGAAC GTATGTATTT ATACCCTATT CCCGAAGTTG AGGTGCTTAA AAGTAACGGA
AGTCTGGTCC AAAATCCGGG CTGGTAA
 
Protein sequence
MKLKRFMLGL MAILTTFMGC KDYLDREIPT NVKDDQVFVN YDRISQAGYG AYAFLFNTIG 
YNRINGAMLA SGCDEADHAD NISSIQRFNT GTWNATFNPE DVWGLFYQGI RRANLFLEES
ADFKNLIYRD TINVTNKDLY KYRVRDLEWL RAENRFLRAY YYAELIKRYG GVPILLKSVT
DIDELNTYKR KTYEECVQFI SDECDAVVPL LNESWVGFDG DKWRGRVTKG AAMALKARVL
LYAASPLNNA SNDITKWQKA AKAAHDVIAL NKYGLHTDYR GLFRLGNGAD GNPEIIFAQQ
GYNRNDYEKY NYPIGYDQGG LGSTSPSQNL VDAYEMKTTG LAITENGSGY DPANPYANRD
PRLGLSILVS NTSFKGRPVE AWVGGLDGLG KFKATTTGYY IRKYVDENLN LAQGATSLHT
WMIFRYAEVL LNYAEAMNEA YGPDVTAGYS MSAKKAVDMV RARTGIAMPP LPPGLSVDEM
RLRIRNERRV ELAFEEHRFF DVRRWKIAAQ TENRPVMAMK ITKNTNGSFS YLVVKAEDRT
FSERMYLYPI PEVEVLKSNG SLVQNPGW