Gene Francci3_3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3857 
Symbol 
ID3906625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4621101 
End bp4622402 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content73% 
IMG OID637881183 
ProductN-succinyldiaminopimelate aminotransferase 
Protein accessionYP_482936 
Protein GI86742536 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03539] succinyldiaminopimelate transaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.947972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.24112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTGAGCGGCC CTGGGAGACC CGGCAACATC TCCGGAGTCC CGCGGTCACC GGCCCGGTCC 
CGGGTGCGAC TGCCCGACTT CCCGTGGGAC CAGCTGGTCT CCTTCAAGGA GAAGGCCCGG
CTGCATCCCT ATGGGTTGGT CGACCTCTCG GTGGGTACTC CGGTCGACGC CACGCCGGCC
GTCGTGCAGC AGGCGCTGGC CGGCTCCGCT GACGCTCCGG GGTACCCGCT GACCGCGGGT
ACCCCGGAGC TGCGCGAGGC GGCGGCCGGC TGGCTGGCCC GCCGGCTCGG CGTGCTCGTC
GACCCGGGCG CGGTGCTGCC CGTGCTCGGG ACGAAGGAAC TGGTGGCCCA GCTCCCCGGT
CAGCTCGGGC TCGAACCCGG TGACCGGGTG TGGGTGCCGA CCCCGGCCTA TCCAACCTAC
GAGGTCGGCG CGCTGCTCGC CCGCTGCGAA CCGGTGGCGG GCCCGGCCGA CGGGGTGACC
CTGATCTGGT TGAACTCGCC GGGGAACCCG ACCGGGCGGG TGCTCACGGT CGACGAGATG
CGCGCCGTGG TCACCTGGGC GCGGGAGCGC GGTGTGATCG TCGCCAGCGA CGAGTGCTAC
ATCGAGCTGG GCTGGGAGAG CCGGCCCGTC TCGGTGCTGC ACCCCGACGT GTGCGGCGGC
TCCCACGAGG GACTGCTGGC GGTGCATTCG CTGTCGAAGC GGTCCAACCT CGCGGGCTAC
CGGGCCGGGT TCGTCACCGG TGACCCGGCC CTGGTCGAGG GTCTCCTCGC GGTCCGCAAG
CACGCCGGCT TCATGATGCC GACGCCCGTG CAGGCCGCCA TGGCGGCTGC GTACGCCGAC
GACATGCATG TGGCGGATCA GCGGGCGCGC TACGCCAACC GGCGGGCCGT CCTCGCGGCG
GCGCTCGCGG TCGCGGGTTT CACCATCGAT CACAGCGAGG CCGGCCTCTA CCTGTGGGCA
ACCCGGGGTG AGGAGGCCTG GGCCACGGTG GACGCGCTGG CCGAGGTCGG GATACTCGTC
GCGCCCGGGA CGTTTTACGG GGAGGCTGGC CGGTATCACG TCCGGATCGC CCTGACCGCC
GCGGACTCGC AGGTGGCGAC TGTTCCCGAG CGGATGACGA TGCTGTCGCC GGTCGCCGCC
ACGGGGCAGC CCGGCCATGG CGCTCGGCCC GGCCATGGCG CTCGGCCCGG CCGTGGCGCT
CGGCCTGACC ATGGCGCTCG GCCCGGCCAT GGCGCTCGGC CCGACTACGG TCAGCCGGTC
ACTCAGGGCA GCTATGGGGG TGCCGAGCCG GACATCCGTT AG
 
Protein sequence
MSGPGRPGNI SGVPRSPARS RVRLPDFPWD QLVSFKEKAR LHPYGLVDLS VGTPVDATPA 
VVQQALAGSA DAPGYPLTAG TPELREAAAG WLARRLGVLV DPGAVLPVLG TKELVAQLPG
QLGLEPGDRV WVPTPAYPTY EVGALLARCE PVAGPADGVT LIWLNSPGNP TGRVLTVDEM
RAVVTWARER GVIVASDECY IELGWESRPV SVLHPDVCGG SHEGLLAVHS LSKRSNLAGY
RAGFVTGDPA LVEGLLAVRK HAGFMMPTPV QAAMAAAYAD DMHVADQRAR YANRRAVLAA
ALAVAGFTID HSEAGLYLWA TRGEEAWATV DALAEVGILV APGTFYGEAG RYHVRIALTA
ADSQVATVPE RMTMLSPVAA TGQPGHGARP GHGARPGRGA RPDHGARPGH GARPDYGQPV
TQGSYGGAEP DIR