Gene Phep_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1301 
Symbol 
ID8252401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1548589 
End bp1550304 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content49% 
IMG OID644934955 
ProductRagB/SusD domain protein 
Protein accessionYP_003091578 
Protein GI255531206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0026313 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAATATTTTT AGCTGCAATC TTTATGGTTC TAATGGCTGC AGCCTGTAAA 
AAAGGAGGGG TGCTGGAGCA GGTTAAAACC ACAGATTTAA CAGAAGAAAG TACCTTTGCA
GACAGTGCCC GCACTATGCA GTTCCTAACC AGGATTTATA CTGATATTGG CTTTAGTTCC
GATCCTAAAA GGTTTGGCAG CAGTGTAGGG GTATACAGCA TTTGTGATGA GGTGGAAGGC
TCGTTGCTCA GTGCTACTGC ATTTAACGTC ATTTTCCAGA CTGGAGCAAT CAGTGCATTA
AATGTGCCTA CTGATGCCTG GGTAACTACA TACGCCAACA TTAGAAGGGT AAACTTATTG
CTGAGCCATT TGCAGACCAC ACCGCTATCT CAGCGTTTAA GGGACAGGAT TGCCGGTGAG
GCCCGCTTTT TAAGGGCCTG GTATTATTTT ATCCTGATTA AACATTACGG GGGCGTGCCG
CTGGTAGGCG ATGTGGTTTA TGGCGCCACT GACCCGGTTT CAGGCAAGCG TGCCACTTAT
GAAGAATGTG TGAATTATAT TGAATCGGAA TGCGATGCTG CTGCCCTGGC CCTTCCACTC
GTACAAACCG GGCTCGATTT TGGACGCATT ACCAAAGGTG CAGCATTGGC ACTAAAATCC
AGGTTGCTGT TGTATGCTGC AAGCCCGCTG TTTAACGGCC GGGTAGATAT GGATGGGGTA
TTGGGTTATC CGAATGCCGA TCCTGCCCGA TGGAGCAAGG CTGCAAAAGC AGCGCTGGAT
GTGATCAGCC TGAACCAGTA CAGTCTTTAT GAGCTGGCTG GCGGTCTGGG CTTTCAGAAA
GTATTTACCC TGCGCAAAAA CAGTGAATAC ATACTGGCTT CCATGGCTGG TAATAACCGT
ACGCTGGAAG CCATCTGGGA TCCGGCGACC AGGACAGGGT CGGGCAGTGC CATGCCCTAC
CAGGAACTGG TAGATGCTTT TGGTACCATC AATGGCAAAG CGATTACGGA GGACCTTAAA
TCGCCTGGAA ACCCTACAGG TTATGATCCC ACAAATCCTT ATGTAAACCG CGATCCCCGT
TTCAACTGGA GCATCCTGTA CAATGAAGCC CCACGGTTGA ACACCAGTAA AACCGTTACA
CCGGTATTTA CTTACGCAGG TGCTGCGCAG GACGGTTTTA ACTTTACCAA AACCGGCTAT
TATTTAAGAA AAATGCTGGA CGACAATACC ATTGCCAGTA GCACCTCATC GGCAACAGAA
CGCTGCTTTC CTTTAATCCG CTATGCCGAG ATCCTGTTAA ATTATGCCGA AGCCAGTAAT
GAGGCAGGTG ATACACAAAC CGCATACACA CAGCTCAAGG CCATTCGCAA GCGCGCAGGC
ATACTGGCTG GTCCGGAAGA CGATTATGGA CTGGCGGAGG GGCTTACTAA AGAAGGGATG
CGGACGGTGA TCCAGAATGA AAGAAGGGTA GAGCTGGCTA TTGAAGAGCA TCGCTACTGG
GATGTACGCA GATGGAAGAT TGCAGAAAAT GTATCCAATA AAACCCTGCA CGGGATGAAA
ATCACCAGGC TAGGTACCGG TACACCTGCA ACCTATACTT ACGAACTGAT CAATATCCGT
ACACCGGCCT TTGTTGCACC GAAATATTAC CTGTGGCCAA TCCCGCAGGG CGAGGTCAAT
AAATCAGCAG AGCTGATACA AAACCCGGGC TGGTAA
 
Protein sequence
MKKKIFLAAI FMVLMAAACK KGGVLEQVKT TDLTEESTFA DSARTMQFLT RIYTDIGFSS 
DPKRFGSSVG VYSICDEVEG SLLSATAFNV IFQTGAISAL NVPTDAWVTT YANIRRVNLL
LSHLQTTPLS QRLRDRIAGE ARFLRAWYYF ILIKHYGGVP LVGDVVYGAT DPVSGKRATY
EECVNYIESE CDAAALALPL VQTGLDFGRI TKGAALALKS RLLLYAASPL FNGRVDMDGV
LGYPNADPAR WSKAAKAALD VISLNQYSLY ELAGGLGFQK VFTLRKNSEY ILASMAGNNR
TLEAIWDPAT RTGSGSAMPY QELVDAFGTI NGKAITEDLK SPGNPTGYDP TNPYVNRDPR
FNWSILYNEA PRLNTSKTVT PVFTYAGAAQ DGFNFTKTGY YLRKMLDDNT IASSTSSATE
RCFPLIRYAE ILLNYAEASN EAGDTQTAYT QLKAIRKRAG ILAGPEDDYG LAEGLTKEGM
RTVIQNERRV ELAIEEHRYW DVRRWKIAEN VSNKTLHGMK ITRLGTGTPA TYTYELINIR
TPAFVAPKYY LWPIPQGEVN KSAELIQNPG W