Gene Phep_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3037 
Symbol 
ID8254149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3628346 
End bp3629560 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID644936686 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003093297 
Protein GI255532925 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0138624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTAA CGGATATAAG ACAGCAGTTC CCCATATTAT CGAGGATGGT GAAAGGTAAA 
CCACTGGTTT ATTTCGACAA TGCTGCTACC TCGCAAAAGC CGCAACAGGT AATTGATGCA
TTGACGCATT ATTATTCGTT TTACAATGCC AATATACACC GCGGCATACA TACGCTTGCT
GAAGAGGCGA CAATGGCTTA TGAAGCTACC CGTGAGGCTG TTAGGGATTT TGTTGGTGCA
GATGCTACTG AAGAGATCAT TTTTACCAAA GGAACAACAG AGGCCATAAA CCTGGTTGCT
TATACCTGGG GCAGACAAAA CATTACTGCA GGGGACGAGA TCATTATATC CGGCATGGAA
CATCATTCGA ATATTGTTCC CTGGCAAATA CTATGTGAAG AGAAAAAGGC TTTCCTGAAA
GTAATTCCTG TTACAGATGA GGGAGAACTT TCCATAGAAG CTTATAAAGA ATTACTGGGC
TCGAAAACAA AACTGGTAGC TGTTGTTCAT GTATCTAATT CGTTGGGTAC CATAAATCAT
GTGAACGAAA TTATCACTGC TGCACATCTT GTTGGTGCCA AAGTGCTGAT AGATGGGGCC
CAATCTGCAG TCCACCTGGA TATCGATGTT CAGAAAATGG ATTGTGATTT TTTTGCTTTT
TCCGGCCATA AGGTATATGG CCCTACAGGG GTTGGTGTAC TGTATGGTAA ACGCGAATTG
TTGGAGGATA TGCCTGTTTT TCAAGGTGGT GGGGAAATGA TCAAAGATGT TACATTTGAG
CAGACTACTT ATAATGACCT GCCTTATAAA TATGAAGCGG GTACACCAAA TATTGCAGAT
ACAATTGCTT TAAAAACAGC ATTGGATTTT ATTACTGCAG TTGGAAAAGA TAAGATCAGG
GTACATGAAG CTAATTTACT GGCCTACGCA ACAGCTCATT TAAAAACCAT TCCGGATTTG
AGCATCATTG GCGAAGCCAA AGACAAAGCG GGTCTGGTGT CTTTTGTTGT TAAAGGTATA
CATCCACAGG ATATTGGGGT ATTGCTCGAT AATATGGGTA TAGCTGTTAG AACAGGACAT
CATTGTACGC AACCATTGAT GAAACGCTTT GGTATCCCTG GTACGGTAAG GGCATCCTTT
GCAATGTATA ACCAACCGGA AGAAATAGAT GTGCTGATTA CCGGACTGCA CAAAACTATA
AAAATGCTAA CGTAA
 
Protein sequence
MVLTDIRQQF PILSRMVKGK PLVYFDNAAT SQKPQQVIDA LTHYYSFYNA NIHRGIHTLA 
EEATMAYEAT REAVRDFVGA DATEEIIFTK GTTEAINLVA YTWGRQNITA GDEIIISGME
HHSNIVPWQI LCEEKKAFLK VIPVTDEGEL SIEAYKELLG SKTKLVAVVH VSNSLGTINH
VNEIITAAHL VGAKVLIDGA QSAVHLDIDV QKMDCDFFAF SGHKVYGPTG VGVLYGKREL
LEDMPVFQGG GEMIKDVTFE QTTYNDLPYK YEAGTPNIAD TIALKTALDF ITAVGKDKIR
VHEANLLAYA TAHLKTIPDL SIIGEAKDKA GLVSFVVKGI HPQDIGVLLD NMGIAVRTGH
HCTQPLMKRF GIPGTVRASF AMYNQPEEID VLITGLHKTI KMLT