Gene Francci3_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3175 
SymbolargC 
ID3903900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3762643 
End bp3763671 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content74% 
IMG OID637880499 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_482261 
Protein GI86741861 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.461089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.232638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTGA CGGTAGCAGT TGCGGGGGCG AGTGGGTACG GCGGCGGAGA GCTGCTGCGC 
CTGCTCCTCG CCCATCCCGA AATCAAGATC GGTGCGCTGG CCGCGAACGC GTCCGCCGGC
CTGCCGGTGA CGGAGGTGCA TCCCCACCTG CCCGACCTGG AGGGCCGGGT GTTCACGGAC
GCGGCCGCGC TGGCCGGGAC CGACGCGGAC ATCGTTTTCC TGGCGCTGCC GCACGGTCAG
TCGGCCGCGG TGGCGGCCAC CCTGCCCGAC ACCGTGCGGG TCGCCGATCT GGGTGCCGAT
CATCGGCTCG TCGACCCGGA GGCGTGGCGG CGCGCCTACG GGGGGGAGCA CGCCGGGACC
TGGACCTACG GCCTGCCCGA GCTCCCCTGG GCACGGGCGG AGATCGCCGC GAGCCGGCGA
GTGGCGATTC CGGGCTGCTA TCCCACGGCG ACCTCTCTCG GGCTCGTGCC GCTGCTGGTC
GGCGGCCTCG TGGAGCCCGC CGACCTGGTC GTTGTCGCGG CGAGCGGCAC GTCCGGCGCG
GGGCGCTCGG CCACGGTGAA CCTGCTCGGC AGCGAGGTGA TGGGTGACCT GACCGCCTAC
AAGGTGGGCA CCCACCAGCA CAGACCCGAG ATCACGCAGA CCCTCTCCCG GGCCGCCGGT
ATGACCGTGA CGGTGTCCTT CACCCCGGTG CTCGCCCCGC TTCCCCGCGG CATCCTCGCG
ACCAGCACCG GCCGGGCCAC CCCGGGCACC GACGCGGACG CCGTGTACGA GACGCTGCGG
GCCGCCTACG CGGGGGAGCC GTTCGTCCGG GTGCTGCCGC CGGGGCGCTG GCCGCACACC
GCGGCGACGC TCGGCGGGAA CGCCGTTCAT GTGCAAGGGA CCTTCGACCC GGAGACCGGC
CGGGCGATCG TCGTCACCGC GATCGACAAC CTCGGCAAGG GCGCGGCCGG CCAGGCGCTG
CAGTGCGCCA ACCTGATGCT CGGCCTGCCC GAGACCGCCG GGCTGACCGC TCAGGGCATC
GCCCCCTGA
 
Protein sequence
MGVTVAVAGA SGYGGGELLR LLLAHPEIKI GALAANASAG LPVTEVHPHL PDLEGRVFTD 
AAALAGTDAD IVFLALPHGQ SAAVAATLPD TVRVADLGAD HRLVDPEAWR RAYGGEHAGT
WTYGLPELPW ARAEIAASRR VAIPGCYPTA TSLGLVPLLV GGLVEPADLV VVAASGTSGA
GRSATVNLLG SEVMGDLTAY KVGTHQHRPE ITQTLSRAAG MTVTVSFTPV LAPLPRGILA
TSTGRATPGT DADAVYETLR AAYAGEPFVR VLPPGRWPHT AATLGGNAVH VQGTFDPETG
RAIVVTAIDN LGKGAAGQAL QCANLMLGLP ETAGLTAQGI AP