Gene Phep_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1801 
Symbol 
ID8252904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2097590 
End bp2099044 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content42% 
IMG OID644935452 
ProductRagB/SusD domain protein 
Protein accessionYP_003092072 
Protein GI255531700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.587569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000152838 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAACTA TCACCAAAAT ATGCAGTAAA TATTTTATGA TTTTATCCAT CATATTTTAT 
ATATCTAGTT TAGGATGTAA GAAGTTTGTG GACATCGATC CTCCTGTCAC AGACCTGACC
ACACTTAGTG CTTTTAAGGA GGATAATACC GCTATAGCTG TAATGACAGG GCATTTTGCT
AGTTTAAGCA ACAACGACCT TGCTTTTACA AATTTTACGG GAATCAGTAG ATTTACCGGC
CTTTCATCAG ATGAATTGAA GTTGTGGGAG GGAGTTTCCG ATGCTAAACA GATCGGCTAC
TTTACCAATG ATTTATTTTC TCATCAGGGC GCTTCAGCAG GGGATGAATA TTGGCCGTAT
ACAGAGATAT ATGCCTGCAA TACCATTATA GAAGGGGTGA GTAATTCTGA TGGTTTGAGT
GCTGCTGTAA AAAATCAATT GCTGGGAGAA GCGAAGTTTA TGAGGGCGTC CTATTTTTTA
AATTTAGCAA ATCTATATGG CGATATCCCA ATGCCTACTT CAACTGTTGT TAAGGTAAAT
ATCCAATTGA GCAGGACTCC CAAAAAACAG GTTTATGAAC AGATTATTAA GGATCTGAAA
GAAGCGGAGG CCTTACTAAG CCCTGAATAT TTAAATGGGG GACTAAAGAA ATATTCATCT
GCTCCGGAAA GGGTACGGCC CACCAGCTGG GCTGCTTCAG CTTTACTGGC TAGGGTTTAC
CTATACAATG AGGAATGGAG TAACGCAGAA CTAACCTCTT CAAAATTAAT AAGTAACAAC
GGTTTATTTG GTTTGGAGCC CTTAAACGAT GTATTTAAGA AAGATAGCCG GGAAGCCATT
TGGCAGCTCC AGACTGTGTA TACTGGTATG AATACCGGTG ACGGAGGCTT TTTTATCCTG
AGTAGTTTTG GACTATCCGA ATATAACCCC GTATACTTAA GTAGCTTTAT GTTAAATGCG
TTTGAACCGG GGGATTCCAG GGCTGTTCCT GGAAACTGGG TTGACAAAAC CGTGATTTCA
GGGACTACTT ATTATTACCC CTTTAAATAC AAATTGAGTT ACGGTTCTAC GCAAGGCGCT
GAATATATCA TGATGCTTCG GTTAGGAGAG CAATATCTGA TCAGGGCAGA AGCAAGAGCA
AAATTGGAAA ATGTTAACGG CGCAAGAGAA GATCTGTTTG TCATCAGAAG AAGGGCAGGT
CTGGCTGATG CCACCTTAAC GGCCAATGAC CAGAATTCCT TACTGACCGC TGTCATGCAT
GAAAGGCAAG TGGAACTTTT TACTGAATGG GGACACCGAT GGTTTGACTT GAAACGTACC
GGTAAGGTAG ATGCCGTAAT GAGTGTGGTT ACCCCAACAA AAGGAGGAAC ATGGCAAAGT
ACAGATCAGC TCTACCCGCT GCCGTTCAGA GATCTTCAAA GGGACAGGAA TTTAACGCAA
AACCCTGGGT ATTGA
 
Protein sequence
MLTITKICSK YFMILSIIFY ISSLGCKKFV DIDPPVTDLT TLSAFKEDNT AIAVMTGHFA 
SLSNNDLAFT NFTGISRFTG LSSDELKLWE GVSDAKQIGY FTNDLFSHQG ASAGDEYWPY
TEIYACNTII EGVSNSDGLS AAVKNQLLGE AKFMRASYFL NLANLYGDIP MPTSTVVKVN
IQLSRTPKKQ VYEQIIKDLK EAEALLSPEY LNGGLKKYSS APERVRPTSW AASALLARVY
LYNEEWSNAE LTSSKLISNN GLFGLEPLND VFKKDSREAI WQLQTVYTGM NTGDGGFFIL
SSFGLSEYNP VYLSSFMLNA FEPGDSRAVP GNWVDKTVIS GTTYYYPFKY KLSYGSTQGA
EYIMMLRLGE QYLIRAEARA KLENVNGARE DLFVIRRRAG LADATLTAND QNSLLTAVMH
ERQVELFTEW GHRWFDLKRT GKVDAVMSVV TPTKGGTWQS TDQLYPLPFR DLQRDRNLTQ
NPGY