Gene Plav_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3331 
Symbol 
ID5455524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3567551 
End bp3568642 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID640878921 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001414592 
Protein GI154253768 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.202512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA CGCCCCCCAA GGACCGCCCC AACCGCTATA CCTACGCACA GGCGGGGGTC 
GATATCGACG CCGGCAACGA GCTGGTCCGG ATGATCGGAC CGCTGGCGGC CTCCACGAAG
CGGCCCGGTT CGGATGCGGC GCTGGGTGGA TTCGGCGGGT TGTTCGATCT GGCCGCCTGC
GGCTTCAAGG ATCCGGTTCT GGTCGCCGCC AATGACGGCG TCGGCACCAA GCTCAAGGTC
GCGATCGAGG CTGACCGCCA CGACACTGTC GGCATCGATC TCGTGGCCAT GTCCGTCAAC
GACCTTGTGG TGCAGGGCGC GGAGCCCCTC TTCTTTCTCG ACTATTACGC GACGGGCAAG
CTGCATGTCG ATGTCGCGCG CGATGTGGTC GCCGGCATCG CCGAAGGCTG CCGCCAGGCG
GGCTGCGCGC TGATCGGCGG CGAGACGGCC GAGATGCCCG GCATGTATGC GAAGGGCGAT
TATGACCTTG CGGGCTTTGC TGTCGGCGCC GTCGAGCGCG ACGGTGTCCT GCCGCGCGGC
GATGTCGCCC CCGGCGACGT GCTGCTCGGC CTCGCCTCCT CCGGCTTTCA TTCCAACGGC
TTTTCGCTCG TTCGCCGGAT CGTCGAGGAC AATCGCATTT CCTACTCCGC GCCCTTCCCC
GGCGGCGACG GCGCCAGCAT CGGCGAAGTC CTGCTCGCAC CGACGCGCAT CTATGTGAAG
GCGATGCTGA AGACGATCCG CGAGACCGCT GCGGTGAAGG CGGTGGCGCA TATCACCGGC
GGCGGTTTCG TCGAGAACAT TCCGCGCGTG CTGCCGGAAG GCATCAATGT CGAGATCGAC
GGCGCCTCAT GGACCATGCC GCCTGTCTTC CGCTGGCTGA TGGAACTCGG CGGCATCGAC
GACACGGAGA TGGGCCGCAC ATTCAATTGC GGCATCGGCA TGGTTGTGGT GGTCCGCGAG
GATCAGGCGC TCGAAGTTTC GGACGCGCTG GCGGAGGCTG GTGAAACGGT TTTCCGCATC
GGCCGTCTCA TCGAGACTGT GCCCGGCGCG GCGCGCGTTG CTGTGAAGGG CGCGCTCGGG
TCCGGCAAGT GA
 
Protein sequence
MAPTPPKDRP NRYTYAQAGV DIDAGNELVR MIGPLAASTK RPGSDAALGG FGGLFDLAAC 
GFKDPVLVAA NDGVGTKLKV AIEADRHDTV GIDLVAMSVN DLVVQGAEPL FFLDYYATGK
LHVDVARDVV AGIAEGCRQA GCALIGGETA EMPGMYAKGD YDLAGFAVGA VERDGVLPRG
DVAPGDVLLG LASSGFHSNG FSLVRRIVED NRISYSAPFP GGDGASIGEV LLAPTRIYVK
AMLKTIRETA AVKAVAHITG GGFVENIPRV LPEGINVEID GASWTMPPVF RWLMELGGID
DTEMGRTFNC GIGMVVVVRE DQALEVSDAL AEAGETVFRI GRLIETVPGA ARVAVKGALG
SGK