Gene Jann_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1836 
Symbol 
ID3934287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1822102 
End bp1823151 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID637904190 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_509778 
Protein GI89054327 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.123616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA GCAAACCGGA CGCGTTGACC TATGCGGATG CGGGCGTCGA TATCGACGCT 
GGCAACGCGC TTGTGGATCG GATCAAGCCC GCCGCTGCCG CGACAAATCG TGCCGGTGTC
ATGGCGGGCC TGGGGGGCTT TGGCGCTTTG TTCGATCTCA AGGCCGCGGG CTATGATGAT
CCGGTTCTTG TGGCCGCAAC GGACGGTGTC GGCACCAAGC TAAAGATCGC CATCGACACG
GGCAACTTCG ACACTATCGG CGTGGATCTG GTGGCCATGT GTGTCAACGA TCTGGTCTGT
CAGGGTGCGG AGCCGCTGTT TTTCCTGGAT TATTTCGCGA CGGGCAAGCT GGATGTCGAT
GACGCCGCGC GTATTGTCGA AGGCATTGCG GCGGGTTGCA AAGCCTCGGG TTGCGCGTTG
ATTGGCGGCG AGACGGCAGA GATGCCGGGC ATGTATGCGC CCGGCGATTT TGACCTTGCG
GGCTTTTCCG TTGGCGCGAT GGAGCGGGGC CGGGCGCTGC CGGACGGCGT GGCAGAGGGC
GATGTGCTGC TGGGGTTGGC GTCAGACGGC GTGCATTCCA ACGGCTATTC GCTGGTGCGC
CGTGTGGTGG AGCGATCCGG GCTGGCCTGG GATGCGCCCG CACCCTTCGC GCAATCCAGT
TTGGGAGAGG CACTTTTGGC CCCCACGCGC CTTTACGTGC AGCCCGCGCT GGCCGCGATC
CGTGCGGGCG GGGTCCATGC CCTGGCCCAT ATCACCGGTG GCGGATTGAC CGAGAACATC
CCCCGCGTTC TGCCCGATGG GCTCGGCGTT GATATCGACC TGTCATCCTG GTCCCTGCCG
CCGGTATTCG GCTGGCTGGC GCAGGAAGGT GCGTTGGATC AGGCGGAGCT TCTGAAGACG
TTCAACGCAG GCCTCGGCAT GGTCCTCGTG GTCTCGGCGG ATGCCGTCGA TGGCCTGACC
TGGACATTGG AAGACGCGGG CGAAAGTGTG CACCGCATCG GCACCGTCAC GGCAGGCGCG
GGCGTGCGCT ACTCGGGATC GCTTGGATGA
 
Protein sequence
MTDSKPDALT YADAGVDIDA GNALVDRIKP AAAATNRAGV MAGLGGFGAL FDLKAAGYDD 
PVLVAATDGV GTKLKIAIDT GNFDTIGVDL VAMCVNDLVC QGAEPLFFLD YFATGKLDVD
DAARIVEGIA AGCKASGCAL IGGETAEMPG MYAPGDFDLA GFSVGAMERG RALPDGVAEG
DVLLGLASDG VHSNGYSLVR RVVERSGLAW DAPAPFAQSS LGEALLAPTR LYVQPALAAI
RAGGVHALAH ITGGGLTENI PRVLPDGLGV DIDLSSWSLP PVFGWLAQEG ALDQAELLKT
FNAGLGMVLV VSADAVDGLT WTLEDAGESV HRIGTVTAGA GVRYSGSLG