Gene Phep_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4202 
Symbol 
ID8255338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5080712 
End bp5082043 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content44% 
IMG OID644937868 
Productargininosuccinate lyase 
Protein accessionYP_003094455 
Protein GI255534083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.088708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCT GGCAAAAAAA CATTGATGTA AACAAGGATA TTGAAGCTTT TACAGTAGGC 
AAAGACAGGG AACTGGATTT ACAGATGGCG GCTTATGATG TTCTGGGCTC ACTGGCGCAT
GTAGAAATGC TGGAAAGTAT TGGCTTACTT ACTGCAGCAG AACTTGCTGA AATACAGAAA
GCGCTTAAAA ATATATATGC AGAAATAGAG GCGGGTAAAT TTGTTATTGA AGATACAGTA
GAGGATGTGC ATTCGCAGGT AGAGTGGCTG TTGACCCAGC GTATAGGTGA TGCCGGTAAA
AAAATCCATA GCGGACGTTC GCGTAACGAC CAGGTGCTGG TTGACCTGAA ATTGTATTTC
AGAAGCTGTA TTGAAGAGAT GGTGGGCAAT ACTGCTGTGT TATTTGCACA GCTGATTGAG
CTGAGCAATA CCCATGCAGC TAAATTGTTG CCGGGGTATA CCCATTTGCA GATTGCAATG
CCTTCGTCCT TTGGTCTGTG GTTTGGTGCC TATGCAGAAA GTTTGGTTGA CGATATGGAG
CTGATGCTGG CAGCCTGGAA GGTATGCAAT AAAAATCCAT TGGGTTCGGC TGCGGGTTAC
GGTTCTTCTT TTCCTTTAAA CAGGACGATG ACTACTGAAC TGCTGGGTTT TGAACGCTTA
AATTACAATG TGGTTTACGC ACAGATGGGC AGGGGCAAAA CGGAAAGAAT CCTGGCCCAG
GCCATGTCGG CCCTTGCCGC ATCGCTGGCA AAAATGGCCA TGGATGTTTG TCTGTTCATC
AATCAGAATT TTGGCTTTAT CAGTTTCCCT GATGAACTGA CTACCGGATC GAGTATTATG
CCGCATAAAA AGAACCCGGA TGTGTTTGAG CTGATCCGCT CACGTTGTAA CAAGATCCAG
GCCTTGCCTA ACGAAATTGC AATGATGATC ACTAACCTGC CTTCCGGCTA TCACCGCGAT
CTGCAATTGC TAAAGGAAAA TCTTTTCCCG GCTATGGTTT CTTTAAACGA ATGCCTGCAG
ATGACTACTT ATATGTTGCA AAACATTAGG GTTAAAGATG GGATACTGGA TGATAAAAAG
TATGCTTACC TGTTTAGCGT TGAGGTGGTG AATGAACTGG CTTTAAAAGG TGTGCCTTTT
AGGGAGGCAT ATAAAATTGT TGGTGAGAGC ATTGAAAATG GCTCGTTTAA GCCTGAAACA
CAGATTAACC ATACCCATGA AGGCAGCATT GGCAATTTAT GCAATGCAGA GATCACCGCG
ATGATGGATG AGGTATTGTC GCAGTTTAAG TTTGAACAAA CACATCAGGC GATAGAGAAA
TTGCTGGCTT GA
 
Protein sequence
MKIWQKNIDV NKDIEAFTVG KDRELDLQMA AYDVLGSLAH VEMLESIGLL TAAELAEIQK 
ALKNIYAEIE AGKFVIEDTV EDVHSQVEWL LTQRIGDAGK KIHSGRSRND QVLVDLKLYF
RSCIEEMVGN TAVLFAQLIE LSNTHAAKLL PGYTHLQIAM PSSFGLWFGA YAESLVDDME
LMLAAWKVCN KNPLGSAAGY GSSFPLNRTM TTELLGFERL NYNVVYAQMG RGKTERILAQ
AMSALAASLA KMAMDVCLFI NQNFGFISFP DELTTGSSIM PHKKNPDVFE LIRSRCNKIQ
ALPNEIAMMI TNLPSGYHRD LQLLKENLFP AMVSLNECLQ MTTYMLQNIR VKDGILDDKK
YAYLFSVEVV NELALKGVPF REAYKIVGES IENGSFKPET QINHTHEGSI GNLCNAEITA
MMDEVLSQFK FEQTHQAIEK LLA