Gene Phep_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1709 
Symbol 
ID8252811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2029402 
End bp2031123 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content43% 
IMG OID644935361 
ProductRagB/SusD domain protein 
Protein accessionYP_003091982 
Protein GI255531610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00943188 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAAGA TTATGAAAAA GATAGTTATA TGCTTTACAG CCTGCTTATA TTTAGCACTG 
GCCGCAGGTT GTAAAAAAAC ACTGGTGGAA GAACCACATT CTGATTTAAC TCCTGAATTT
TTTTCCACCG CTCAGGGTTT TCAAAAAGGA CTTGAAGCTG CTTATGCAGG AACCCGCTCC
TTCTGGGGAA ATCAAAACCT TTTTACGATG ACAGTAATAG GCACAGATGA ATTCTATACC
GGAAAGGATG GAAACAACAA CATCAACAAA TACAATAGCA ATTACGATAC GGCTAATGGA
ACCGTAGAGG CCATCTGGAA AGACTGTTAT ATGAACATTA ATACTTGTAA TGGTGTAATT
GATAATTCTG CTGCAGTTAC CGGATTGGAT GATAACCTTA AAAGCAGGAT TGTGGCCGAA
ACTAAATTCC TTCGGGCCAA TTACTATTTT ATCCTGGTAC AATTTTGGGG TGATGTTACC
TTAAACAAGA CTTTCCAGAG CACGCCCATC ACGTCGGCAA CGCGCACGCC AATGGCAGAA
GTATATGATT TTATTGTGAA AGACCTACAG GATGCCATTG CTACCCCAAG TTTTTATGCC
AATCCAAAAT CATCAGGGGC ACTACCCGGG GTAGCTACAA AAGCAGCTGC ACAACATTTG
CTGGCCAAAG TATATTTAAC CCGGGCAGGA TCTTCGGCAA AAAAAGCAAA TGATTATATA
GATGCTTACA ATATGGCCAA AACCGTAATT ACAACCAGCG GACTGTCATT GCTGCAGGAT
TTTGGTGATG TGTTTGCAGA AGGAAATGAA AACAGCAATG AAGTGATCTG GAATGTTCAG
CATACCTCTA ACCTGGCTTA TAATGGTCCA AATAACAGTG GAGGGGCTGA TAATGTATTG
AACCACATGT GGGTGCCGCA GTACGAACTG CGCCCTGGTA TGCAGCGTTC CGTAACTTAT
GGTCGTCCAT ATATCCGTTG CGTGCCTACT GCATGGCTAA CCAATGTGGC TTTCCAGGAA
AGGGTAAATG ACACCCGTTA CAACAAAACC TTTTTAACTA CCTGGATCAG TAACAATGCC
TCTTCTCTTC CTAAATGGGA AGCACCAATA CCACCGGGAG CTCCAGCTAA TGCTGCCGTT
GGGCAAGTCA AATTCACTGT TGGCGATACA GCTATATTTA TGCCGGGTTT TGATGTAAGT
GATGCTAAAA TTGCTGCTAC CAGGTATCTG CTGATCCCTC CACGTAAATA TGACATCACC
TTGTCGCCCT ATATGAAAAA GTATAACGAT ACCAAACGTG CTGATTTAAA TTATCCTTCT
ATCAGACCGG TGATCGTATA CCGCCTGGCA GAAACTTATT TAATTGCTGC AGAGGCTGCA
TTTATGGGTG GAGCTACCAT GACAGATGCC CTTGATAACA TCAATTTTGT AAGGAGAAGG
GCCGCCTATC CTAACGCCAA TCCGGCAGTA ATGAACGTGA CTACCATTCC TTCACTTGAT
TTTATCCTGG ATGAACGCAC AAGGGAGCTG TGTGGTGAAA ATGTAAGGTG GTGGGACCTG
GTGCGGACCA ATAAATTGAT TGATCGGGTA AAAACTAAAA ATTACAATCC TGAGGCAGCA
GCCAATATTA AGCCTTTCCA TGTTTTAAGG CCAATACCTC AAAAACAGAT TGACGGGGTT
ACCACAGGAC CTAAATATCC TCAAAATGAG GGTTGGTTTT AG
 
Protein sequence
MIKIMKKIVI CFTACLYLAL AAGCKKTLVE EPHSDLTPEF FSTAQGFQKG LEAAYAGTRS 
FWGNQNLFTM TVIGTDEFYT GKDGNNNINK YNSNYDTANG TVEAIWKDCY MNINTCNGVI
DNSAAVTGLD DNLKSRIVAE TKFLRANYYF ILVQFWGDVT LNKTFQSTPI TSATRTPMAE
VYDFIVKDLQ DAIATPSFYA NPKSSGALPG VATKAAAQHL LAKVYLTRAG SSAKKANDYI
DAYNMAKTVI TTSGLSLLQD FGDVFAEGNE NSNEVIWNVQ HTSNLAYNGP NNSGGADNVL
NHMWVPQYEL RPGMQRSVTY GRPYIRCVPT AWLTNVAFQE RVNDTRYNKT FLTTWISNNA
SSLPKWEAPI PPGAPANAAV GQVKFTVGDT AIFMPGFDVS DAKIAATRYL LIPPRKYDIT
LSPYMKKYND TKRADLNYPS IRPVIVYRLA ETYLIAAEAA FMGGATMTDA LDNINFVRRR
AAYPNANPAV MNVTTIPSLD FILDERTREL CGENVRWWDL VRTNKLIDRV KTKNYNPEAA
ANIKPFHVLR PIPQKQIDGV TTGPKYPQNE GWF