Gene Syncc9605_1448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1448 
Symbol 
ID3736040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1358436 
End bp1359596 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID637776041 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_381757 
Protein GI78212978 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0729808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCG TTGTGGGAGG TGGTCAGCTG GCACGGATGC TTGTGCAGGC CGCCGCGCAA 
CGCGAGGTTC CGATCGCCGT TCAGACAGCC AATCCAGCGG ATCCCGCCGC TGGATTGGCT
TCGCGTCTCG TGTCGGCGGA TCCGCGGGAT GTGGCTGGAA CCCGGGAGCT GGTGGTGGGC
TGTGACGGCG TCACCTTCGA GAACGAGTGG GTCAACATCG ACGCCCTTCT GCCGTTGGAG
CAGCAGGGCG TTCGTTTCCA GCCATCCCTG GCTGTGCTCT CGCCCCTGGT CGACAAGTTG
TCGCAGCGTC AGCTGCTCGA CGATCTGGCG ATCCCAAGCC CGCCCTGGTG TCCGCTGCGT
TTGATTTCGC CGGCCCAACC TGCGCTTCCT CAGGGCTGGA CCTTTCCCGT CATGGCCAAG
GCCTCCCGCG GGGGATATGA CGGGAAGGGC ACGGTGGTGT TGCGCGATAT CGATGCCCTG
TCGCAGTTGT TGCGAGCCGT CCCTGCCGAG GATTGGTTGC TGGAATCCTG GGTGGACTAT
GAACTTGAGC TGGCTCTCGT GGTCAGCCGC GATCAGCGCG GCCGGATCCG CCATTTCCCT
CTGGCGCAGA CCCATCAGCA TCAGCAGGTT TGTGACTGGG TTCTGGCACC GGCACCGGTG
GACCCTTCCG TAGCGGCCTT GGCCTACAAC GTTGCTGCGT CGCTGATGAC GAAGCTCGGC
TATGTGGGGG TGTTGGCTCT GGAGTTTTTC TATGGACCAG CTGGCCTGCA GGTGAATGAG
ATCGCCCCCC GTACCCACAA TTCAGGCCAT TTCTCGATCG AAGCCTGCAC CAGCAGTCAG
TTCGATCAGC AGCTCTGCAT CGCGGCTGGT CTTCCCGTGC CTGATCCCGA GCTCAAAAGC
CGCGGTGCCT TGATGGTCAA CCTGCTGGGC CTGGACCCCG AACGCCATGA TCCCTTGGAC
CAGCGGCTCC AGGCGCTGGA AGCCATGCCA GGGCTTCACC TGCACTGGTA CGGCAAGTCA
CCGGAAACTC CGGGGCGCAA GTTGGGCCAC GTGACGCTGC TGCTCGAGAC TGAAACGGTG
GCGATGCGTC GCGATGAGGC CGAGTCAGCA CTTGCCGCCA TTCGCAGGAT CTGGCCCCAC
GCGAGCGAGA GTCAGGACTA G
 
Protein sequence
MIGVVGGGQL ARMLVQAAAQ REVPIAVQTA NPADPAAGLA SRLVSADPRD VAGTRELVVG 
CDGVTFENEW VNIDALLPLE QQGVRFQPSL AVLSPLVDKL SQRQLLDDLA IPSPPWCPLR
LISPAQPALP QGWTFPVMAK ASRGGYDGKG TVVLRDIDAL SQLLRAVPAE DWLLESWVDY
ELELALVVSR DQRGRIRHFP LAQTHQHQQV CDWVLAPAPV DPSVAALAYN VAASLMTKLG
YVGVLALEFF YGPAGLQVNE IAPRTHNSGH FSIEACTSSQ FDQQLCIAAG LPVPDPELKS
RGALMVNLLG LDPERHDPLD QRLQALEAMP GLHLHWYGKS PETPGRKLGH VTLLLETETV
AMRRDEAESA LAAIRRIWPH ASESQD