Gene Phep_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2140 
Symbol 
ID8253246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2467299 
End bp2468888 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content44% 
IMG OID644935789 
ProductRagB/SusD domain protein 
Protein accessionYP_003092406 
Protein GI255532034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.305226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TATCAATTTT ATTTTTGTTA AGCCTATTGC TGGTATCCTG TTATAAAGAT 
GCACTGGAAA CACAGCCAAA CGACAGGTAT ACAGAAGATA CTTACTGGAC TTCTGAAAAA
ACAGCTATGG CCGGTTTAAC CGGATGTTAC CAGGTACTGA CCTCCAATTC ATTATATGGA
TATGCCACTC CCTTATGGGA AGAAACGGCT ACGCCCAATG CCTATAATTA TGATAATTCT
GCTGGTTTTG GCCTGATTGC CCTGGGTACA CATACGGCTA CCAATGCCGG GGCCGGAAAT
ATTGTTTCTG GTGTAATCGA ATTCAGATGG AAAGACTGCT ACCGTGGGAT TGGCAGGTGT
AATACCCTTT TGGACCGGAT CAATGCTGTA CCCATGGCCG ATGTTTTAAA AGACAGGATG
AAAGCGGAAG CTAAGTTTTT AAGGGGACTG TATTATTCCA TACTGGCCAC TTATTATGGT
GGTGTACCGC TTATTTTAAC ACCCCCGAAT TTTGATCAGG ACGCAAAATT ACCACGAAAT
AGCCGGGCAG AAGTTGTTCA GCAGGTTGTA AAGGATATGG ACGAAGCTGC AATGGTACTG
CCACCCAAGT TTACAGGTAA CGATATAGGC CGTGCAACAA GCGGTGCAGC TTTGGCCATT
AAAGCAAGGA TGTTACTATT TGAGGCCAGT CCGCTTAATA ACCCTTCCGG CGATCTAACC
AAATGGGTTG CCGCCGCCAA TGCGGCAAAA GCCATAATAG ACCTTCCGGG AACAGGTTAT
GGCTTGTTTC CAAATTACAG ACAATTGTTT TTACCAGCCA ATGAGAACAA ACAGGAGACC
GTTTTTGATG TACAGTACAC AATTTCCACA ACAGGTTTTG GTAATTCTTT CGACCTCATT
AACCGGCTTT ACAATACCAA CGCACCCTTG CGAGACCTGA TCAATGCTTA TGACATGAAA
GATGGGCTTC CACCAGCCCA ATCGCCATTG TACGATGCTT TAAAGCCATA TGATAATCGT
GACCCGCGCA TGTACCAAAC CATAATTTAT CCGGGAGATA CCTATCTGGG GGCACCCGTT
ACTACCGCCA CTTTTAAACA AACCGGATAT GGGGTAAAAA AATATGGCAT ATATGATAAA
GAAGCGGTTG CAGCTGCCGA TCTGATCAAC AGCGCCGGAC GATCGCAGAT CAATTATATG
GTGGTACGTT ATGCGGATGT ACTGCTGATG TATGCCGAAG CACAAAATGA AGTACTCGGG
GCTCCTGACG TTACTGTACG AAATGCTGTT GAACTTGTAC GTCAGCGGGC AGGACTGGTG
CCTTATCAGG TATCAGCTAC GCTTACAAAA CCACAGATGC GCGAGCTAAT CAGGCATGAA
AGGAGAATAG AATTTGCATG TGAAGGTTTT TATTATACAG ATATCAGAAG ATGGAAAACG
GCTGAACAGG TGTTAACCGG CCCTATATTC AATTCGCAGA ACCAGCAAAT TGTTACCCGA
AATTTTAACC CATTGAGAGA TTACTGGTGG CCAATTGCGC AAACCCAGAG AGAGCTTAAT
CCAAACCTTG AACAGAATGA TAATTATTAA
 
Protein sequence
MKNISILFLL SLLLVSCYKD ALETQPNDRY TEDTYWTSEK TAMAGLTGCY QVLTSNSLYG 
YATPLWEETA TPNAYNYDNS AGFGLIALGT HTATNAGAGN IVSGVIEFRW KDCYRGIGRC
NTLLDRINAV PMADVLKDRM KAEAKFLRGL YYSILATYYG GVPLILTPPN FDQDAKLPRN
SRAEVVQQVV KDMDEAAMVL PPKFTGNDIG RATSGAALAI KARMLLFEAS PLNNPSGDLT
KWVAAANAAK AIIDLPGTGY GLFPNYRQLF LPANENKQET VFDVQYTIST TGFGNSFDLI
NRLYNTNAPL RDLINAYDMK DGLPPAQSPL YDALKPYDNR DPRMYQTIIY PGDTYLGAPV
TTATFKQTGY GVKKYGIYDK EAVAAADLIN SAGRSQINYM VVRYADVLLM YAEAQNEVLG
APDVTVRNAV ELVRQRAGLV PYQVSATLTK PQMRELIRHE RRIEFACEGF YYTDIRRWKT
AEQVLTGPIF NSQNQQIVTR NFNPLRDYWW PIAQTQRELN PNLEQNDNY