Gene Phep_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1016 
Symbol 
ID8252110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1189255 
End bp1191039 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content46% 
IMG OID644934670 
ProductRagB/SusD domain protein 
Protein accessionYP_003091299 
Protein GI255530927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00288117 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAG ACATGAAAAA GATTAAGAAC GTTTTTGTTG CCATTGTATC GGTAACCTGT 
TTAATAAGCT GCTCAAAGGA ACGGCTGGTA GAAAGTCCGC CAAACTTTAT AACTGCCGAT
GTATTATATA CCACTGTTGA AGGTTTTGAT GCCGGCCTTA ATGGCTTATA TGCGTCGGTA
AGGGATGAAA GAAATGGATT GAATTATACC AGCGGATTTG GCACCATAGA TCTGAGGGCA
GCCATCATGA TTACCGGTAC AGATAATATT GGTGCCGGAA GTAATACCGG GGGATTGTCG
ACCATTACTG CCGACTGGAC CAAAAATACG CCGGCCGACC CCAATATTGA CAAATTGTTT
TTATGGTTGT ATAAAGTAGT GAGTGCGGCC AATACCATCA TTACCAGAGC AGAAAATCCG
GTGATAAAAT GGGGTGCCGG TAATAAAGAA AGGGTATTGG GCGAAGCCAG GACGGTAAGG
GCCTGGGCAT ACCGTCACCT TACCTACTTA TGGGGCGATG TACCTTTATT AACGGAAGAG
GTATCAGGCG AGAATATCAG AACAGACCTT GTACGTGAAA AAGTAGCTAC CATCAGAAGG
GTGATGATTG AAGATTTTAA ATATGGAGCC GAAAACCTTC CCTGGTCGCC CACCAAAGCA
GGCCGCTTAA CCAGGGGAGT GGCGCAAACC TACCTTTCAG AAATCTACCT TGCTGTGGGA
AAACCGGATT CTGCATTGTT CTGGGCAAAT GAATGTATCA CCAAAGGACC TTATAGTTTG
GTTAAGACCA GGTATGGTGC TGGTTCCGAC CAGCCTGGTG TAGCTTTTAT GGATATGTTC
AATCCGGGCA AAACAAATAT TTCAGATGGA AATACGGAAG CCCTTTGGGT AATGCAGTGG
GAAAGGAACG CGATTGGGGG AGGTGAGAAC CTGATGCGCC ATGAAACAAC AATGAGGTAT
CCGAATGCGA GGTATGTGAA CCGTGTAGGT TTTCTTACGG CAACAGATGC CCGCGGTGGC
CGGGGCTGGA GCCGTCAGAC CATTACCAAA CAGGCTTTAT TGTGGTACAC CGCTTCGAGT
GACGTTCCGG CAGGTAAGGT TGACCAGAGA AGTTCTGAAT TTGCACTGCG TAAATATTTT
ATCCTGGGGG CCGATGATAA TTTTGCAGGT CTGACCAATA CGGCGACCAA AGCACCATGG
AAATTAGGGG ATACCGTATG GCTGGCAACC GGGATATCAA GAAGTATAAT TGCTGATCCT
GGTATGAAAT CTAAAAGTGG TGCGGTTAAT TTTTCCTTAC TGCCGGATGG TACTGCCAAT
AACAACGACT GGCCATACAG TTTGAAATTT GCTTATAATG ATGCCGGCTT TGCCAATACA
ACAGAGTCTC ACCAGGACCA GATCTATATG CGTCTGGCAG AGACCATTTT GTTAAGGGCA
GAGGCTAAAG GCCGTTTGGG TAACTTACCT GGAGCTGCAG ATGACATTAA CCTGCTGCGT
GACCGTGCGA ATGCCAAACG GGTTATTGTG TCCAACTTTG GTGCAACATT AACTACTTTC
CTGGATTATA TCCTGGACGA GCGCTCCCGG GAATTGCTGG TGGAAGAACA CCGCAGGTAT
ACCTTATTGC GTATGGGTGG GGCCAGTTTT TTCTACAGGA GAACCAATAC CTTCAATACC
ATCAGTAAAA ACCTTGCACT AAGGGATACA TTGTTGCCTA TTCCGCAATC GGTAATTGAT
GCAAACCTGA CTTTAAAAAT GCCTCAAAAT CCTGGATTTA ACTAA
 
Protein sequence
MLKDMKKIKN VFVAIVSVTC LISCSKERLV ESPPNFITAD VLYTTVEGFD AGLNGLYASV 
RDERNGLNYT SGFGTIDLRA AIMITGTDNI GAGSNTGGLS TITADWTKNT PADPNIDKLF
LWLYKVVSAA NTIITRAENP VIKWGAGNKE RVLGEARTVR AWAYRHLTYL WGDVPLLTEE
VSGENIRTDL VREKVATIRR VMIEDFKYGA ENLPWSPTKA GRLTRGVAQT YLSEIYLAVG
KPDSALFWAN ECITKGPYSL VKTRYGAGSD QPGVAFMDMF NPGKTNISDG NTEALWVMQW
ERNAIGGGEN LMRHETTMRY PNARYVNRVG FLTATDARGG RGWSRQTITK QALLWYTASS
DVPAGKVDQR SSEFALRKYF ILGADDNFAG LTNTATKAPW KLGDTVWLAT GISRSIIADP
GMKSKSGAVN FSLLPDGTAN NNDWPYSLKF AYNDAGFANT TESHQDQIYM RLAETILLRA
EAKGRLGNLP GAADDINLLR DRANAKRVIV SNFGATLTTF LDYILDERSR ELLVEEHRRY
TLLRMGGASF FYRRTNTFNT ISKNLALRDT LLPIPQSVID ANLTLKMPQN PGFN