Gene Francci3_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3173 
Symbol 
ID3903898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3760264 
End bp3761352 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content73% 
IMG OID637880497 
Productacetylglutamate kinase 
Protein accessionYP_482259 
Protein GI86741859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase 
TIGRFAM ID[TIGR00761] acetylglutamate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0759623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.135118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC CGACCCGGAC GCCGCCGCCT TCGAACGGTG GGCACGGCAG CACCGGCAGC 
ACCGGCAGCA CCGGCGACGC CGCCCCGGGC GGGGGCACCG GGCGGGGCCC GGCCGCCACC
GCCCGCGGCC ATGCGGCGCT CGCGAAGACC CAGGTCCTCA TCGAGGCGCT CCCGTGGCTG
TCGCGGTTCC AGGGCGCGAC CATCGTCGTC AAGTACGGCG GCAACGCGAT GACGGAGCCG
GCGCTGCGCG AGGCCTTCGC CGCCGACGTC GTGTTCCTGC GCCACTCGGG GCTGCGGGTG
GTCGTCGTGC ACGGCGGCGG TCCGCAGATC ACCGCGCATC TGGAGCGCCT CGGTGTCCCC
TCAACGTTCG TCGGCGGCCT GCGGGTCACC ACCCCACAGA CCATGGACGT CGTGCGGATG
GTCCTGCTCG GCCAGGTCAA TCGGGACGTC GTGGGGCTCG TCAACGACCA CGGCCCGTTC
GCCGTCGGGC TGTCCGGTGA GGACGCCAAC CTCTTCACCG CGCGGCGCCG CCCGGCGATC
GTCGATGGCC GGGAGGTCGA CGTCGGCCTG GTCGGCGACA TCGTCGAGGT CCGACCGGAG
ACGATCAACG CCCTGCTCGG CTCCGGGAAG GTGCCGGTGG TCGCGTCGGT CGCCCGCGGC
GTCGACGGCG GGGTCTACAA CGTCAACGCC GACACCGCCG CCGCCGAACT CGCCGTCGCG
CTCGGGGCTA CGAAGCTCGT CGTCCTGACC GACGTCGAGG GCCTCTACGC GGACTGGCCG
GCGAGCGACG AGGTGATCAG TGAGCTGAGC ATCACCGAGC TCGAACAGCT CCTCCCCTCG
CTCACCGCCG GCATGATTCC CAAGATGGAG GCCTGCCGGC GGGCGGTGCG TGGCGGTGTT
CCGCAGGCGC ACGTGCTCGA CGGACGGGTG CCGCACGCGG TGCTCCTGGA GATCTTCACC
GACGATGGCA TCGGCACCTT GATCATGGCC GAGTCGGGCA CCTCGCCTGA GCCGGGTACG
CCCCCCGCAC CCGCCGCGCG CCCGGCCGGG ATCGTTCCGG CCGGCGAACC GACCGGAGGA
ACGCCATGA
 
Protein sequence
MNAPTRTPPP SNGGHGSTGS TGSTGDAAPG GGTGRGPAAT ARGHAALAKT QVLIEALPWL 
SRFQGATIVV KYGGNAMTEP ALREAFAADV VFLRHSGLRV VVVHGGGPQI TAHLERLGVP
STFVGGLRVT TPQTMDVVRM VLLGQVNRDV VGLVNDHGPF AVGLSGEDAN LFTARRRPAI
VDGREVDVGL VGDIVEVRPE TINALLGSGK VPVVASVARG VDGGVYNVNA DTAAAELAVA
LGATKLVVLT DVEGLYADWP ASDEVISELS ITELEQLLPS LTAGMIPKME ACRRAVRGGV
PQAHVLDGRV PHAVLLEIFT DDGIGTLIMA ESGTSPEPGT PPAPAARPAG IVPAGEPTGG
TP