Gene Franean1_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1754 
SymbolpyrG 
ID5670156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2104159 
End bp2106117 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content72% 
IMG OID641240675 
ProductCTP synthetase 
Protein accessionYP_001506098 
Protein GI158313590 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0206256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00103098 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGCAGG CGCATGTGAC CAAGCACGTG TTCGTGACGG GAGGTGTGGC GTCCAGTCTC 
GGTAAGGGAC TGACGGCGTC CAGCCTCGGC CGTCTGCTCA AGGCCCGCGG TCTGCGGGTC
ACGATGCAGA AGCTCGACCC GTATCTGAAT GTTGATCCGG GAACGATGAA CCCGTTCCAG
CACGGCGAGG TGTTCGTCAC CAACGACGGC GCCGAGACCG ACCTGGACAT CGGGCACTAC
GAACGCTTCC TCGACGTCGA CCTCGACGGC AGCGCGAACG TGACCACCGG GCAGGTGTAC
TCGGCGGTCA TCGCCCGCGA GCGGCGCGGC GGGTACCTCG GTGAGACGGT GCAGGTCGTC
CCGCACATCA CCGACGAGAT CAAGGGCCGG ATCCGCCGGC TGGCCACCGA TTCGGTGGAC
GTGGTGATCA CCGAGGTGGG TGGCACCGTC GGCGACATCG AGTCGCTGCC CTACCTGGAG
GCGATCCGCC AGGTCCGGCA CGAGGTCGGC CGGGACAACG CGCTGACGGT GCACGTCAGC
CTCGTCCCGT ACCTCGCGCC GTCGGGTGAG CTGAAGACCA AGCCGACCCA GCACTCGGTC
GCGGCGCTGC GCAGCATCGG CCTGCAGCCG GACGCGGTGG TCTGCCGCTC CGACCGGCCG
TTGCCCGACG CGCTCAAGCG CAAGATCGCC ATGATGTGCG ACGTCGACGA CGAGGCGGTC
GTCGGCGCAC CGGACGCCGG CTCGATCTAT GACATCCCGA AGGTGCTGCA CAAGGAGGGC
CTCGACGCCT ACGTCGTGCG CCGGCTCGGC CTCAGCTTCC GGGACGTCGA CTGGACGGAG
TGGGACACCC TGCTCCGCCG TGTGCACCAT CCCTCGAACA CCGCCACGGT GGCGATCGTC
GGCAAGTACG TCGACCTGCC CGACGCCTAC CTGTCGGTGA CCGAGGCGCT GCGCGCCGGC
GCGTTCGCCG TCGACGCCCG CGCGGACATC CGCTGGATCG GGTCGGACGA GTTCACCTCG
CCCGCCGCCG CCGCGCACGC GCTGGAAGGC GTGGACGGGA TCATCGTCCC GGGCGGGTTC
GGGATCCGCG GCATCGAGGG CAAGATCGAG GCGCTGCGGC ACGCCCGCGA ACACGGCATT
CCCACGCTGG GGATCTGCCT GGGCCTGCAG TGCATGGTGA TCGAGGCCGG CCGTTCGCTG
GCCGGGCTGG CCGGGTCGAA CTCCACCGAG TTCGACCCGG AGACCGCCCA CCCGGTGATC
TCCACGATGG CCGACCAGCA CGACGTGGTC GCCGGCAAAC GGGATATGGG CGGCACGATG
CGGCTCGGCC TGTACCCGTG CGCGCTGGGC GCGGGGACCG TCGCCCGCCG GGAGTACGGC
GAGCCGGAGG TGCTCGAGCG CCACCGGCAC CGCTACGAGG TCAACAACTC CTACCGGGAG
CGGCTGACCG CCGCCGGGCT GGTGTTCTCC GGCACCTCGC CGGACGGCCG CCTGGTCGAG
GTCGTCGAGC TGCCGGCCGA CGTCCACCCG TTCTACGTGG GGACGCAGGC GCACCCGGAG
TTCCGGTCGC GGCCGACGCG CGCGCACCCG CTGTTCCGCG GGCTCGCCGC CGCTGCCGTG
GCCCACGCCG ACCGGCGCCG GGGCCGGCTG CCCGTCGACA TCCCGGACGG TGGTGTCGCG
GGCGGTGGTG TCGCGGGCGG CGGGCCCGCC GATGGTGACA TGATCGGGTC GGGTGACGGT
GACGCGGCCG GTGCGCGGGC GGGCAACGGT GCCCGGTCGG CAGGTGGGCG GGCCCGCCGG
GGCCGCGCGG AGCAGAGCGC GGCGGCCGGC GCCGGCACGA CCGCCGGTGT CAGCGGTGCC
GCGGGTACCG GTGGTGCCGT GGGTGCGGCA TTGCCCGGTG GGTCCACGGG GCCGGCCGCG
GCGGCCGGTG TGAGCGACGG AGAGCTGGTT TCCCCATGA
 
Protein sequence
MAQAHVTKHV FVTGGVASSL GKGLTASSLG RLLKARGLRV TMQKLDPYLN VDPGTMNPFQ 
HGEVFVTNDG AETDLDIGHY ERFLDVDLDG SANVTTGQVY SAVIARERRG GYLGETVQVV
PHITDEIKGR IRRLATDSVD VVITEVGGTV GDIESLPYLE AIRQVRHEVG RDNALTVHVS
LVPYLAPSGE LKTKPTQHSV AALRSIGLQP DAVVCRSDRP LPDALKRKIA MMCDVDDEAV
VGAPDAGSIY DIPKVLHKEG LDAYVVRRLG LSFRDVDWTE WDTLLRRVHH PSNTATVAIV
GKYVDLPDAY LSVTEALRAG AFAVDARADI RWIGSDEFTS PAAAAHALEG VDGIIVPGGF
GIRGIEGKIE ALRHAREHGI PTLGICLGLQ CMVIEAGRSL AGLAGSNSTE FDPETAHPVI
STMADQHDVV AGKRDMGGTM RLGLYPCALG AGTVARREYG EPEVLERHRH RYEVNNSYRE
RLTAAGLVFS GTSPDGRLVE VVELPADVHP FYVGTQAHPE FRSRPTRAHP LFRGLAAAAV
AHADRRRGRL PVDIPDGGVA GGGVAGGGPA DGDMIGSGDG DAAGARAGNG ARSAGGRARR
GRAEQSAAAG AGTTAGVSGA AGTGGAVGAA LPGGSTGPAA AAGVSDGELV SP