Gene Pnuc_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1550 
Symbol 
ID5052508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp1619764 
End bp1621104 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content47% 
IMG OID640471723 
ProductSAF domain-containing protein 
Protein accessionYP_001156328 
Protein GI145589731 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000710074 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTTA TTCAGAAGTT GAAGGCGCGT GCTGCCAATA ACAATCCTGT GCGTGTAGGT 
GTGATTGGAG CAGGCAAGTT TGGTTCGATG TATCTTTCCC AGGCACCTCG TACTCCCGGC
ATACATTTAG TAGCCGTTGC CGACTTATCT CCAGCGCGTG CAAAAGAATC ATTGGCTCGT
GTAGGTTGGG ATACACCTCG TTATAGCGCT ACATCAATGC AAGATGCTGC TAAATCAGGC
GCTACTTTTG TGACTGATGA TGCAGAAAAG ATGATTGCTA GTGAGTACAT CGACATCGTG
ATTGATGCTA CCGGGAGTCC AGCTGCTGGT ATTCGTCATG CATTGCTTTG TTTTGATCAT
CGCAAACATA TCATTATGGT CAACGTAGAG GCTGACGTGT TGGCTGGCCC ATTGTTAGCG
CGTAAGGCAG CAGAGGCTGG GGTCATTTAC TCCATGGCTT CTGGTGATCA ACCAGCTCTC
ATTGCTGAAC TAGTTGATTG GGCTAGAACG ATTGGCCTTG AAGTAGTGTG CGCTGGTAAG
GGCACTAAGT ATTTGCCTAT CTATCACCAG TCCACTCCAG ATACTGTCTG GGGGCACTAC
GGATTTTCTG AAGAGCAGGT GGCTGGTGGC GACTTTAATG CACAAATGTT CAACTCATTC
TTGGATGGCA CTAAATCAGC GTTAGAAATG GCGGCAGTAT CGAATGGTTG CGATTTAACG
CCTCCAAGCA ATGGCTTGGA ATTTCCACCT TGCGGGGTTG ATGATTTACC GCACATCTTT
CGCCCCATAT CTGAGGGTGG AATTCTGAAG CAAAAAGGAA CTGTAGAGGT AGTCTCTTCA
GTTGAAAGAG ACGGTCGCCC AGTATTTAGA GATTTACGTT GGGGTGTATT TGCAGTGTTT
GAGGCACCGA GCCAATATGT TATTGATTGC TTCTCGCAAT ATGGCTTAAA GACCGATAGC
ACTGGTAAAT ATGCAGCAAT GTATAAGCCT TATCACCTTA TAGGCTTAGA ACTTGGTATC
TCAGTTGCGA GTATTGCCGT ACGTGGCGAA GCTACAGGCG CTACAGGTGA TTGGAGGGGT
GATGTGGTTG CCACCACTAA GCGTGCACTG AAAGCGGGCG AAAAATTAGA TGGAGAGGGC
GGTTTTACCG TTTACGGCAA ACTCATGACA GCTGCTGATT CCTTAAAACT CGGCGCTCTA
CCAATTGGTC TGGCACACAA CATGGCTCTG AAAAGAGATA TTCCTGCGGG AAAACCAGTT
TGCTGGAGTG ATGTTGACTA CGATGCCACT AAGCAAGCAA TAGCGTTCCG TAGGGAAATG
GAAACAGTAT TTGGTAAATA G
 
Protein sequence
MSLIQKLKAR AANNNPVRVG VIGAGKFGSM YLSQAPRTPG IHLVAVADLS PARAKESLAR 
VGWDTPRYSA TSMQDAAKSG ATFVTDDAEK MIASEYIDIV IDATGSPAAG IRHALLCFDH
RKHIIMVNVE ADVLAGPLLA RKAAEAGVIY SMASGDQPAL IAELVDWART IGLEVVCAGK
GTKYLPIYHQ STPDTVWGHY GFSEEQVAGG DFNAQMFNSF LDGTKSALEM AAVSNGCDLT
PPSNGLEFPP CGVDDLPHIF RPISEGGILK QKGTVEVVSS VERDGRPVFR DLRWGVFAVF
EAPSQYVIDC FSQYGLKTDS TGKYAAMYKP YHLIGLELGI SVASIAVRGE ATGATGDWRG
DVVATTKRAL KAGEKLDGEG GFTVYGKLMT AADSLKLGAL PIGLAHNMAL KRDIPAGKPV
CWSDVDYDAT KQAIAFRREM ETVFGK