Gene Pnec_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1042 
Symbol 
ID6183030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp904716 
End bp905924 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content46% 
IMG OID641671654 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001797831 
Protein GI171463718 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.37384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGTA AAACTACACG CAAAAAACTG GATTTTTCTA AACTTGCACT CGAAACCATA 
GCAGTTCGTG TAGGCACTCG TCGTACGGCT GAATATCAAG AGCACTCCGA GGCAATGTTC
CTCACATCAA GCTTTTGCTT TGATAGCGCC GAATTAGCGG CTGATGGCTT TGCTCATGCT
GATCAAGGTT TTATTTATTC ACGCTTTACC AATCCAACCG TGAGTATGTT CCAAGATCGC
TTGGCTGCTC TTGAGGGTGG CGAAGCTTGT ATTGCTACTG CCTCTGGTAT GTCTGCCATT
CTGACAATGG CAATGGCTCA CCTGCAAGCA GGTGATCACG TTATTTGCTC GCGTTCTGTA
TTTGGTGCAA CGATTCAGTT GTTCACGAAT ATTTTGGGTC GCTTTGGTAT TACGACAACT
TATGTTGATT TGACTGATAC TAAGTCATGG CAGGCTGCTG TCCAACCAAA CACCAAACTC
TTTTATCTAG AGACACCTTC CAATCCTTTG ACTGAGATTG CGGATATCAA AGCAATTTCA
AGGATAGCAA AAAAGGCAGG TGCCTTGTTT GCTGTAGATA ACTGCTTCTG CACTCCGGCA
TTACAAAAAC CATTGGCGCT TGGTGCTGAT GTTGTGATTC ATTCTGCAAC TAAGTATTTA
GATGGTCAGG GCAGGATGGT TGGTGGCGCC ATTGTAGGCA ACAAAGATTT CATTATGGGA
AAAGTGTTCC CTTATGTGCG TACTGCAGGC CCAACACTGT CAGCATTCAA TGCTTGGGTA
TTCTTAAAAG GCTTGGAGAC TCTAGAGCTT CGCATGAAGC AGCAGAGTCA AAATGCGCTT
GCCTTGGCTC AATGGTTGGA GAAGCAACCT GGCGTAGAAC GCGTGTACCA TCCAGGCCTG
AAAACGCACC CTCAACATGC CTTAGCTAAA CGCCAGCAAA AAGAGGGTGG GGCGATTCTA
TCTTTTACCC TCAAGGGTGG AAAGAAGGCG GCATTCAAAC TTATCAATCA AACCAAGCTC
TGCTCGATCA CTGCAAACTT AGGGGATACC CGCACAACAA TTACCCATCC AGCGACAACG
ACACATTGTC GCGTCAGTCC TGAAGCCAGA AAAGCAGCAG GCATATCCGA TGGATTGGTG
CGTATTGCAG TTGGCCTCGA GAATATCAGC GATTTAAAGA ACGACCTCCT TGGTGGACTC
AAAAAATAA
 
Protein sequence
MKSKTTRKKL DFSKLALETI AVRVGTRRTA EYQEHSEAMF LTSSFCFDSA ELAADGFAHA 
DQGFIYSRFT NPTVSMFQDR LAALEGGEAC IATASGMSAI LTMAMAHLQA GDHVICSRSV
FGATIQLFTN ILGRFGITTT YVDLTDTKSW QAAVQPNTKL FYLETPSNPL TEIADIKAIS
RIAKKAGALF AVDNCFCTPA LQKPLALGAD VVIHSATKYL DGQGRMVGGA IVGNKDFIMG
KVFPYVRTAG PTLSAFNAWV FLKGLETLEL RMKQQSQNAL ALAQWLEKQP GVERVYHPGL
KTHPQHALAK RQQKEGGAIL SFTLKGGKKA AFKLINQTKL CSITANLGDT RTTITHPATT
THCRVSPEAR KAAGISDGLV RIAVGLENIS DLKNDLLGGL KK