Gene Francci3_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3026 
Symbol 
ID3904379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3591193 
End bp3592386 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content73% 
IMG OID637880346 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_482112 
Protein GI86741712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.992363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0418598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCA AGGCGGCCGG GAGCGGTGCC GAGGACCGGC CGTGGCGCCC GTGGGACGCT 
GACGGCCTGC CGCTGCGCGA CAGCCTGCGC GGGCTCTCGC CCTACGGCGC TCCGCAGCTC
GACGTGCCGG TGCGGCTCAA CACCAACGAG AACCCGCACC CGCCGTCGGT CGGGCTGGTC
GACGCCATCG GCAAGGCCGC GGCCCTCGCC GCCACCGAGG CGAACCGCTA TCCCGACCGG
GACGCCGAGG CGCTGCGTGC CGACCTCGCC TACTACCTGA CACCGGACGC CGGCTTCGGC
GTGCACACCA GTCAGGTCTG GGCCGCCAAC GGCTCCAACG AGATCCTCCA GCAGCTCCTC
CAGGCGTTCG GCGGGCCCGG TCGGGTGGCG CTCGGGTTCG AGCCGTCCTA CTCGATGCAC
CGCCTCATCG CGCTGGCCAC CGCCACCGAG TGGGTGGCCG GGCAGCGAGC GGAGGACTTC
ACGCTCTCCC CGGCCGTGGT CACCGACGCG ATCGCCCGGC ACCGACCCGC CCTGGTCTTC
CTCTGCTCAC CGAACAATCC GACGGGCACG GCCCTGCCGC CCGAGGTCGT CGCCGCCGCC
TGTGAGGCGG TGGAGGCCAC CGGCAGCGGC ATGGTCGTCG TCGACGAGGC CTACGCCGAG
TTCCGCCGGG CCGGCGTGCC CAGCACGCTG ACGCTGCTGC CCCGTCATCC GCGGCTCGTC
GTCACCCGGA CAATGAGCAA GGCGTTCGCG CTGGCCGGTG CCCGGGTAGG CTATCTTGCC
GCGCACCCCG CCGTGGTGGA CTCGCTGTAC CTGGTGCGGC TGCCCTACCA CCTGTCGAGC
TTCACCCAGG CCGTCGCGCG CACGGCGCTC GCGCACGCCG ACGAGCTGCT CGGCACGGTG
GAGGCGGTCA AGGCCCAGCG CGACCGCATC GTCCGGGAGT TGCCGGCGCT CGGTCTGCGG
CTGGCTCCGA GCGACGCCAA CTTCGTGTTC TTCGGCCGGT TCGCCGATCA GCGGGCGGTG
TGGCAGAGCC TGTTGGACGC GGGAGTGTTG GTCCGCGATG TCGGTCTGAC CGGCTGGCTG
CGGGTCACCG CCGGGCTGCC GAACGAGGTG GACGCGTTTC TCGGGGCGCT GGGCAGAACT
CTCACCGGCA GCGTCATCGG CGCGGACGGT GTCATCAGCC TCGCCACCGC CTGA
 
Protein sequence
MIFKAAGSGA EDRPWRPWDA DGLPLRDSLR GLSPYGAPQL DVPVRLNTNE NPHPPSVGLV 
DAIGKAAALA ATEANRYPDR DAEALRADLA YYLTPDAGFG VHTSQVWAAN GSNEILQQLL
QAFGGPGRVA LGFEPSYSMH RLIALATATE WVAGQRAEDF TLSPAVVTDA IARHRPALVF
LCSPNNPTGT ALPPEVVAAA CEAVEATGSG MVVVDEAYAE FRRAGVPSTL TLLPRHPRLV
VTRTMSKAFA LAGARVGYLA AHPAVVDSLY LVRLPYHLSS FTQAVARTAL AHADELLGTV
EAVKAQRDRI VRELPALGLR LAPSDANFVF FGRFADQRAV WQSLLDAGVL VRDVGLTGWL
RVTAGLPNEV DAFLGALGRT LTGSVIGADG VISLATA