Gene Jann_3353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3353 
Symbol 
ID3935826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3405002 
End bp3406066 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content65% 
IMG OID637905726 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_511295 
Protein GI89055844 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.708465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC CGTTACCGGC TGGCAGCACC GTCGGCATTC TGGGTGGCGG TCAGTTGGGC 
CGCATGTTGG CTATGGCCGC TGCGAACCTC GGCTACCGGG CCCACATCTT TGAGCCCGGC
CCCGCCCCCG CCGCCGATGT GGCCCATGCC TGGACGCAAG CCGGCTACGA TGACCTCGAC
GCCCTGCGCA GCTTCGCGCA GGCCTGCGAT GTCATCACCT TCGAGTTTGA GAATATCCCC
GCCGACGCCC TCCACGTCAT CGCCAGCACC ACACCCCTGT TCCCGGACCG CCGCGCGTTG
GAAACCAGCC AGGACCGCCT GATTGAGAAG GCCTTTCTCC GCGATATCGG CCTGAAAACA
GCGCCCTATG CGCCGGTCAG CGGTGACATT CACGATGTGC TGACCACGAC GGGCACCCCC
GCGATCCTGA AGACCCGCAG GTTCGGCTAT GATGGCAAGG GACAGGCGCG CGTCATGGAC
ATGGGCGAGG CCGGGGCCGC CTTGGCCGCA CTGGAAGGCG CGCCCGCGAT TGCAGAGGGG
TTCGTCGATT TCTCCACAGA GATCAGCGTC ATCGCGGCGC GCGGTCAAGA TGGGTCCGTC
GCGGCGTTTG ACCCCGGCGA GAACGTCCAT AAGGATGGCA TCCTCGATAC GACCACAGTG
CCCGCCGCGA TTCCGGCGTC CCTGCGCACC GACGCGGTGC TGATCGCATC GCAGATCCTC
ACTGCCCTCG ACTATGTGGG CGTTCTGGGG GTGGAGTTGT TCGTCACGCC CGCAGGTCTG
ATCGTCAATG AAATCGCCCC GCGCGTGCAC AATTCCGGCC ATTGGACCCA AGCGGGCTGC
GCCGTGGACC AGTTTGAGCA ACACATGCGG GCCGTCACCG GCTGGCCGCT CGGGGACGGC
AGCCGTCATG CCAACGTTGT GATGGAAAAC CTCATCGGCG AGGACATCGC GCGCGCGTCA
AACCTGGCCA GTGAACCCGG CGTGCAGATC CACCTTTACG GCAAGGCTGA GACGCGGCAG
GGGCGCAAGA TGGGCCATAT CAACCGCGTG ACAGGTCCGG CGTAA
 
Protein sequence
MSDPLPAGST VGILGGGQLG RMLAMAAANL GYRAHIFEPG PAPAADVAHA WTQAGYDDLD 
ALRSFAQACD VITFEFENIP ADALHVIAST TPLFPDRRAL ETSQDRLIEK AFLRDIGLKT
APYAPVSGDI HDVLTTTGTP AILKTRRFGY DGKGQARVMD MGEAGAALAA LEGAPAIAEG
FVDFSTEISV IAARGQDGSV AAFDPGENVH KDGILDTTTV PAAIPASLRT DAVLIASQIL
TALDYVGVLG VELFVTPAGL IVNEIAPRVH NSGHWTQAGC AVDQFEQHMR AVTGWPLGDG
SRHANVVMEN LIGEDIARAS NLASEPGVQI HLYGKAETRQ GRKMGHINRV TGPA