Gene Phep_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3980 
Symbol 
ID8255114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4799346 
End bp4801016 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content43% 
IMG OID644937644 
ProductRagB/SusD domain protein 
Protein accessionYP_003094233 
Protein GI255533861 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT ATAAACAGAT GAAAAATATA TATGTATCCA TAATTGGCAG TATAATGCTC 
TTTTCCTCCT GCCAGAAATT ATTAGATGAA GATGTTAGAA ATCAAATCTC CAATCTTTAT
TATACCACAC CCAGCGGCGT GGAAGACGGG GTAAAGGCCA GCTACAGCTA TCTGCGAAAC
TGGTATGGCA AGCAAGGGTC AGCCTGGTTA ACGATGTATG GGGATGAATA TACCAATGGA
GCTGCCGATG CCACAATGAA TGGTTATACT TCTGGTCTGA ACTCATCTTA TTTAGTGGTC
AGAGATGGCT GGAACCAAAC CTATACGGCC ATCAATACCT GTAATGCAGT TTTGGCTGCG
GCAGAAAAAG TGAATATGAC TGAGCAATTA AAAAACACCA GAATCGGTGA ACTCAAATTT
TTGCGGGCTC ATTATTTCTT TCTCCTCGTT CAAACCTATG GGGCGATTCC ACTGCCTTTA
ACACCAACTA CAAGTGCTTC GAGTGTGGCC ACCAGGACAC CGATTGCTGA AGTTTACAAA
GCAATTGTTG ATGACCTGAT TTTTGCTGTG GCCAATTTAC CAGAAACTAC CCCTGACTAT
GGCCGGGCAA CTTCACTTGC TGCCAAGCAT GCTTTAGCTA AAGTGTATTT AACCCGGGCC
GGTACAGCAG TTAAAGAAGC TACAGATTAT GCCAAAGCGG CAACTCTGGC AAAAGAGGTA
ATTGCATCTG GCAAATATTC TTTATTACCC GATTTTGCAG CAATATTTGC ACAAGGGCCC
GGAGGAAGGA ACAACGAGGT TATATTCGCC TGTCAATATG ATGTGAACCA GCTCACCGGT
GGTACAGCTG GAAATCTTGG AAATCAGACA CATCTCTTTT TTACCACAGA TTATACTTCC
CAGCCGGGCA TGGTACGCGA TGTAGCTAAC GGGCGGCCCT ACAATCATTT CAGGCCAACC
AATTTTATGT TGGGACTGTA CAACAAACAA TATGACTCCC GTTTTAGCAA GTCGTTTAAA
ACGGTATGGT ATTGTAACAA GCCAGGAAGT TATACCATCA GTGGTAAAAC TGTAGCTATG
GCATTGGGTG ATACTGCTGT GGTGATGACA GATTACGAAC CTACCCAGGC ACAAAGGAAT
GCCGCAAAAT ATACGATCAT CTCACCCAGT CAGCAACACA ATACCCTGTT TCCCCAATCT
TCAAAACACA TAGACGGATT AAGGGCCGAT GTTAACAATA TAAATGGTGT AAAAGATGTG
CTTATATTTA GATTAGGTGA AACTTATCTT ATTGCTGCTG AAGCATTACT GATGGACGGC
AAGGCTGGAG ATGCGGTGTT CTATGTGAAT GAACTGAGAA AAAGAGCTGC CATTGTTTCT
TCCGATCCTG CTGTAACAAC CGCTAACAGG CAGGCCATGG AGGTAAGTGC TTCGGACCTT
AACATAGATT TCATTTTGAA TGAAAGGGCC AGGGAGCTTA ACGGGGAGTA TATGAGGTGG
TTTGATCTGG TAAGAACAGG CAAATTATTG GAACGCGTGA AACTATACAA TACACTTGCT
GCACCTAATA TTAAATCCTA TCATGTGTTG AGGCCTGTTC CACAAACCCA GATAGACAGG
GTTATTGGTG GGGCAAGTGC TTTTCCTCAA AACCCAGGAT ACGACAATTA G
 
Protein sequence
MNNYKQMKNI YVSIIGSIML FSSCQKLLDE DVRNQISNLY YTTPSGVEDG VKASYSYLRN 
WYGKQGSAWL TMYGDEYTNG AADATMNGYT SGLNSSYLVV RDGWNQTYTA INTCNAVLAA
AEKVNMTEQL KNTRIGELKF LRAHYFFLLV QTYGAIPLPL TPTTSASSVA TRTPIAEVYK
AIVDDLIFAV ANLPETTPDY GRATSLAAKH ALAKVYLTRA GTAVKEATDY AKAATLAKEV
IASGKYSLLP DFAAIFAQGP GGRNNEVIFA CQYDVNQLTG GTAGNLGNQT HLFFTTDYTS
QPGMVRDVAN GRPYNHFRPT NFMLGLYNKQ YDSRFSKSFK TVWYCNKPGS YTISGKTVAM
ALGDTAVVMT DYEPTQAQRN AAKYTIISPS QQHNTLFPQS SKHIDGLRAD VNNINGVKDV
LIFRLGETYL IAAEALLMDG KAGDAVFYVN ELRKRAAIVS SDPAVTTANR QAMEVSASDL
NIDFILNERA RELNGEYMRW FDLVRTGKLL ERVKLYNTLA APNIKSYHVL RPVPQTQIDR
VIGGASAFPQ NPGYDN