Gene Francci3_0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0064 
Symbol 
ID3905399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp80424 
End bp81671 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID637877394 
Productaminotransferase AlaT 
Protein accessionYP_479187 
Protein GI86738787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCA CCCAGTCCGA CAAGCTCGCC GACGTCTGCT ACGACGTCCG TGGGCCTGTC 
CTCGACGAAG CGACCCGGCT GGAAGCTGCG GGGTCGCGCA TCCTCAAGCT GAACATCGGC
AATCCCGCGC CGTTCGGCTT CTCCGCACCG CCGGAGGTAC TCGAGGCCGT CGTGGCAAAC
CTTGCGGATG CACAGGGTTA CAGCGACTCC AAGGGACTAC TCGCCGCCCG GGAAGCGGTC
GTGCGCTATC ACCTCCGCAA GGGGGTCACC GGCATCGACC CCGGCGGGGT CTACCTCGGC
AACGGCGTCT CCGAACTGAT CATGATGTCG TTGCAGGCGT TGCTCAACAA CGGCGACGAG
GTGCTGCTCC CCGCGCCCGA CTATCCACTG TGGACGGCCG TGGTCAGCCT GTGCGGTGGC
CGGCCCGTGC ACTACCTCTG CGACGAGTCC GCCGGCTGGG CGCCCGACCT CGACGACATC
GCCGCCAAGG TCACCCCGCG GACACGAGCG ATCGTCGTCA TCAACCCGAA CAACCCGACT
GGTGCCGTCT ACGACCGGCA GGTGCTGGAG AACATCGTCG AGGTCGCCCG CCGCCACCAC
CTGATGCTGC TGTCCGATGA GATCTACGAC CGGATCCTCT ACGAGGACGC CGAGCACATC
GCGACCGCAG CGCTCGCGCC GGACCTGGTC TGCATGACCT TCAACGGGCT GTCGAAGTCC
TATCGGCTGG CCGGGTTCCG GGCCGGGTGG ATGGTGATGT CCGGTCCGCG CGGCCACGCC
TCGAGCTACA TCGAGGGAGT GAACATCCTC GCGAACATGC GCCTGTGCGC CAACGCGCCC
GGGCAGTTCG CCACGGTCGC CGCCCTCACG GAGGACGGCG GCGCAGGGGA CCTCGTCCTG
CCCGGCGGCC GGCTGCGCGA ACAACGAGAC ACGGTCGTGA AGCTCCTCAA CGACATCCCC
GGGGTGTCGT GCGTCCCGCC GCGGGGGGCG CTGTACGCCT TCCCCCGGCT GGACCCCGCC
GTCTACCCGA TCCGGGACGA CGAGCGCTTC GTCCTCGATC TGCTGTTGGC CGAGAAGATC
CTGCTCGTCC AGGGCAGCGG CTTCAACTGG CCGCATCCCG ACCATGTCCG GATCGTGACC
CTGCCCGCGG TGGACGATCT CACGGACGCC ATCGGCCGGA TCGATCGCTT CCTGGCCTCC
TACAAACGCC CCTCCCAACA ACAGTGCCCC TCCCAACGAC GGAACTGA
 
Protein sequence
MEFTQSDKLA DVCYDVRGPV LDEATRLEAA GSRILKLNIG NPAPFGFSAP PEVLEAVVAN 
LADAQGYSDS KGLLAAREAV VRYHLRKGVT GIDPGGVYLG NGVSELIMMS LQALLNNGDE
VLLPAPDYPL WTAVVSLCGG RPVHYLCDES AGWAPDLDDI AAKVTPRTRA IVVINPNNPT
GAVYDRQVLE NIVEVARRHH LMLLSDEIYD RILYEDAEHI ATAALAPDLV CMTFNGLSKS
YRLAGFRAGW MVMSGPRGHA SSYIEGVNIL ANMRLCANAP GQFATVAALT EDGGAGDLVL
PGGRLREQRD TVVKLLNDIP GVSCVPPRGA LYAFPRLDPA VYPIRDDERF VLDLLLAEKI
LLVQGSGFNW PHPDHVRIVT LPAVDDLTDA IGRIDRFLAS YKRPSQQQCP SQRRN