Gene Dret_2411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2411 
Symbol 
ID8420271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2760423 
End bp2763410 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content62% 
IMG OID645039012 
ProductPhosphoribosylformylglycinamidine synthase 
Protein accessionYP_003199271 
Protein GI258406529 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00147677 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGCCC GATTGAGTGT GGGCATGCGC GAGCATGTCC AGGATGTGCA GGGAGCGAAG 
ATCGCTCGTG AGATTCAGGA AACCTGCGGG ATCACGGTCG AAGCCGTGTC CCTGGTCCAG
ATGTATACCC TCCAGGGACT GAGCGCTGAG GAGTGCCGGC AGATCCAGGA CCAGGAAGTG
CTCCACGACC CGGTATTGCA CGAGATCCGG GAGACGCCGT GGCCGGCCCA GGGGGACTGG
ATCGTCGAAG TCGGCTTCAA ACCGGGCGTG ACCGACAATC CCGGACGGGT CGCCCTGCAG
ACCGTGCGCA CGGTGCTCGA CCTGCCGGCC GAACGCGCCG CCCGGGTCTA CACATCGCAG
GAGTATCATA TCCGCGGGGC CCTTACCCGT GAGGATGTGG AGCGCATTGG CCGTGATCTT
CTGGCCAATG AACTCATCCA TCGTCTGCGG ATCAAGGACC GCACCGCCTG GGAGGCCGAT
CCCGGCTTTC CCGCCGACCC CCCGGAGGTC ACCGGACAGG CGACCGATAT CGTGGAGACA
GTTTCCTTGT CCGGCCTGGA CGAGCCCGCC CTGCAGCAGC TCAGTCAGGA TCGGGTCCTG
GCCCTGAGCC CCAAGGAGAT GCGGGCCATC AAGGCGTACT ATCGCCGCGA AGATGTCCGG
GCTTCCCGGG CCGGGTTGGG GTTGCCGACA GAGCCCACGG ATGCCGAACT GGAATGCCTG
GCCCAGACCT GGTCTGAGCA CTGCAAACAT AAGATTTTCA ACGCCCGCAT CGACTACACC
AACCGGGAGA CCGGGCAGAA GCGGATTGAG GACTCTCTGT TCGGGACCTA TATCCAGGGC
AGTACGAAAA CCCTCCGGCG TCGCCTCGGT GCGGCCGATT TCTGTCTCTC GGTCTTCAAG
GACAATGCCG GGGTGATCCG TTTTAATGAC GATTACAGCG TGTGCATCAA GGTCGAGACG
CACAATTCCC CTTCGGCTTT GGATCCTTAC GGCGGGGCTC TGACCGGGAT CGTCGGCGTC
AATCGTGACC CGATGGGCAC GGGCATGGGC GCCAATCTGA CCTGCAACAC CGATGTCTTT
TGTTTTGCCT CCCCCTTTTA CGATCAGCCG TTGCCGCCAC GCCTGCTCCA TCCGCGGCGG
GTTCTGGAAG GGGTCCGCGA AGGTGTGGAG CACGGGGGGA ACCAGTCCGG GATTCCGACG
GTCAACGGCG GCCTCGTTTT TGACGAACGC TACCTCGGCA AGCCCCTGGT CTTCTGTGGC
ACGGTCGGGA CCATGCCGGC CAAGGTCAAG GGACGGCCGA GCCATGAGAA AAAAGCCCTT
CCCGGTGACG CGGTGGTCAT GGTCGGCGGC CGGATTGGCA AGGATGGGAT CCACGGGGCC
ACCTTTTCTT CGGAAGAACT CCACGCCGGT TCCCCGGCAA CGGCGGTCCA GATAGGAGAT
CCCATCACCC AGCGCAAGCT CTATGATTTT CTGATGCGCG CCCGGGATCT GGGGATGTAC
AACGCCCTGA CTGACAATGG GGCCGGCGGG CTCTCGTCCT CGGTGGGCGA GATGGCCGAA
GACAGCGGCG GCTGTGAACT CGACCTGGCC CTGGCGCCCT TGAAATACGA CGGGCTGGCT
CCCTGGGAAA TCCTGCTCTC CGAGGCCCAG GAGCGGATGA CGGTGGCTGT GCCCCCGGAC
CAGCTCGAGG CCTTTTTGGC CTTGTCCCGG GAGATGGATG TCGAATCCAC TGCATTGGGG
CACTTCACCG ACAGCGGCTA TTTCCACGTC CGTTACAGCC AGGCCACGGT GGCTTTTCTG
GAGATGGAAT TTCTGCACAA CGGTTTGCCG CAGATGGAGT TGGAGGCGGT CTGGGAACGC
CCGGCACTCG CGGAGACCTC CGTCACCTTG CCCGGCGAGG AACAGGCCGA AACACTGCTC
GTGGACCTGC TCGGACGATT GAATATTTGT AGTCGGGAAT ACGTCATCCG GCAGTATGAC
CATGAGGTTC AGGGCGGCAG CGTGGTCAAA CCCCTTGTCG GCCAGTTCGC GGACGGCCCC
GGTGACGCCG CGGTTATTCG CCCGGTGCTC GAGAGCCGTA AAGGGCTGGT GGTCTCCAAT
GGAATCGCCC CCAAGTACAG CGATTTGGAC ACCTATTGGA TGGCCGCCAA TTGTATCGAC
GAGGCCATTC GCAATGCCGT GGCCGTGGGG GGACGTCTCG ATCATATGGC GGGGATCGAC
AATTTTTGCT GGTGCGATCC GGTCCAATCG GAAAAGACCC CGGACGGGCG GTACAAATTG
GCCCAGCTGG TCCGGGCGAA TCAAGCCCTG GCCCAGTACT GCCTTGCCTT CGGTGTGCCG
TGCATTTCAG GCAAAGATTC CATGAAAAAC GATTACTACG GCGGGGGGCG GAAAATCTCC
ATCCCGCCGA CGCTTCTTTT TTCTGTCATC TCGGTCATGC CTGATGTCCA ACACAGCCTG
ACGTCGGATC TCAAGCAGGC TGGAGACGGC CTGTATCTGT TGGGACGAAC CCGAGCTGAG
TTCGGCGGCA GTGAAATCGC TCAAATGCTT GGGCAATCAG GCGGGGCGGT GCCGGAAGTT
GATGCGTTGG CGGCCAGGAA GCGGTACCAG ACCTTCAACG AGGCCACGAG CGCGGGGCTG
GTCAATGCCT GCCACGACCT TTCCGACGGT GGCCTTGGTG TGGCCGCAGC GGAAATGGCC
ATTGGGGGGC GCCTGGGAGT GCGTATTGAT GTCGCTACCG CGCCTTGTGC CCCGGCGGAT
CTGACCCCGC TGGAACGGTT GTTCAGTGAG TCGGCGAGTC GATTGCTGGT CAGTGTGCCA
GCCGAACGGC AGGAGACCTT TGAGGCCCTG TTTGCCGGCC AGGCCTGCGC TCGTATCGGT
GAGGTGACAG CCGCACCGGA GATCGTCTTT ACCTCAAGCG AGACGGCGTT GTGCCGGCTC
AACGTTGAGC AGTGCGCTGA GGCCTGGAAG GCGACGCTGA ACTGGTAG
 
Protein sequence
MIARLSVGMR EHVQDVQGAK IAREIQETCG ITVEAVSLVQ MYTLQGLSAE ECRQIQDQEV 
LHDPVLHEIR ETPWPAQGDW IVEVGFKPGV TDNPGRVALQ TVRTVLDLPA ERAARVYTSQ
EYHIRGALTR EDVERIGRDL LANELIHRLR IKDRTAWEAD PGFPADPPEV TGQATDIVET
VSLSGLDEPA LQQLSQDRVL ALSPKEMRAI KAYYRREDVR ASRAGLGLPT EPTDAELECL
AQTWSEHCKH KIFNARIDYT NRETGQKRIE DSLFGTYIQG STKTLRRRLG AADFCLSVFK
DNAGVIRFND DYSVCIKVET HNSPSALDPY GGALTGIVGV NRDPMGTGMG ANLTCNTDVF
CFASPFYDQP LPPRLLHPRR VLEGVREGVE HGGNQSGIPT VNGGLVFDER YLGKPLVFCG
TVGTMPAKVK GRPSHEKKAL PGDAVVMVGG RIGKDGIHGA TFSSEELHAG SPATAVQIGD
PITQRKLYDF LMRARDLGMY NALTDNGAGG LSSSVGEMAE DSGGCELDLA LAPLKYDGLA
PWEILLSEAQ ERMTVAVPPD QLEAFLALSR EMDVESTALG HFTDSGYFHV RYSQATVAFL
EMEFLHNGLP QMELEAVWER PALAETSVTL PGEEQAETLL VDLLGRLNIC SREYVIRQYD
HEVQGGSVVK PLVGQFADGP GDAAVIRPVL ESRKGLVVSN GIAPKYSDLD TYWMAANCID
EAIRNAVAVG GRLDHMAGID NFCWCDPVQS EKTPDGRYKL AQLVRANQAL AQYCLAFGVP
CISGKDSMKN DYYGGGRKIS IPPTLLFSVI SVMPDVQHSL TSDLKQAGDG LYLLGRTRAE
FGGSEIAQML GQSGGAVPEV DALAARKRYQ TFNEATSAGL VNACHDLSDG GLGVAAAEMA
IGGRLGVRID VATAPCAPAD LTPLERLFSE SASRLLVSVP AERQETFEAL FAGQACARIG
EVTAAPEIVF TSSETALCRL NVEQCAEAWK ATLNW