Gene Pnap_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1093 
Symbol 
ID4687346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1159718 
End bp1160932 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID639834093 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_981331 
Protein GI121604002 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.560821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC AAGCATTGCC CGGGAACCTG CACCCCGACA CGCTGGCCGT GCGTGTCGGC 
ATCGAGCGCA GCCAGTACGG CGAGAACTCC GAAGCTCTGT ATTTGACCAG CGGCTTCGTG
CAGCCCGACG CCGAAACCTC GGCGCGCCGT TTTGCCGGCA CCGAAGAAGG CTTTACCTAC
GCCCGCACGT CGAACCCGAC GGTGGCCGCT TTTGAGCAGC GGCTGGCGGC GCTGGAAGGC
ACTGAAGCCG CCATCGCCGC GTCCAGCGGC ATGGGCGCCA TCCTGATGAT GGGCATGGGC
CTGCTCAGGG CCGGCGACCA TGTCGTGTGT TCGCAGTCGG TGTTCGGCTC GACGCTGAAC
CTGTTCGGCA AGGAGTTCGC CAAGTTCGGC GTCGAGACCA GCTTTGTCTC GCAGACCGAC
ATCGCGCAGT GGCAGGCCGC CATGCGTCCT AATACGAAGC TGCTGTTTGC CGAAACTCCG
ACCAATCCGC TGACAGAGGT GTGCGACATC CGGGCGCTGG CCGATGTGGC GCATGCGGGC
GGCGCGCTGC TGGCGGTGGA CAACTGTTTT TGCTCGCCCG CGCTGCAGCG GCCGACTGAA
CTGGGCGCCG ACCTGGTGAT CCACTCGGGC ACCAAGTACC TCGACGGACA GGGCCGGGTC
ATGGCGGGGG CGATTTGCGG GCCTTCGAAG CTGATCGTCG ATGTGTTCGG CCCGATTGTG
CGCACCGCCG GCATGGTGCT GGCGCCGTTC AATGCCTGGG TCGTGCTCAA GGGCATGGAA
ACGCTGCGCA TCCGCATGCA GGCGCAAAGC GCCACGGCGC TGGCGATTGC GCAATGGCTG
GAAACGCATC CGGCCGTGAC CCGCGTGTAT TACCCCGGCT TGCCGTCGCA CCCGCAGCAC
GAACTGGCGA TGCGCCAGCA GTCGGGTTTG GGCGGCGCCG TGGTGTCCTT CGACGTGCGC
GGCGGCGACC CGGAGACGGC GCGCGCCAAT GCTTTCCATG TGATCAACAG CACGCAGGTG
GTCAGCATTG CCACCAACCT GGGCGACACC AAGTCCATCA TCACCCATCC CGGAACCACT
TCCCATGGCC GGCTCACCGA AGCGCAGCGC CAGGCAGCCG GCATCAAGCA GGGCCTGATC
CGCTTTGCCA CCGGCCTGGA ACATATCGAC GATTTGAAAG CCGACCTCGC ACGCGGTCTG
GACAGCCTGA CATGA
 
Protein sequence
MTRQALPGNL HPDTLAVRVG IERSQYGENS EALYLTSGFV QPDAETSARR FAGTEEGFTY 
ARTSNPTVAA FEQRLAALEG TEAAIAASSG MGAILMMGMG LLRAGDHVVC SQSVFGSTLN
LFGKEFAKFG VETSFVSQTD IAQWQAAMRP NTKLLFAETP TNPLTEVCDI RALADVAHAG
GALLAVDNCF CSPALQRPTE LGADLVIHSG TKYLDGQGRV MAGAICGPSK LIVDVFGPIV
RTAGMVLAPF NAWVVLKGME TLRIRMQAQS ATALAIAQWL ETHPAVTRVY YPGLPSHPQH
ELAMRQQSGL GGAVVSFDVR GGDPETARAN AFHVINSTQV VSIATNLGDT KSIITHPGTT
SHGRLTEAQR QAAGIKQGLI RFATGLEHID DLKADLARGL DSLT