Gene Francci3_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2452 
Symbol 
ID3905064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2859542 
End bp2860867 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID637879782 
Productaminotransferase, class I and II 
Protein accessionYP_481548 
Protein GI86741148 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.410489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAA CCCATTACCC GCGCAAGGAA GACCTGCGCA AGGAAGACCT GCACTCCAGC 
GTCGCCGATC CCGTCATGGA CACGATGAAC TTTCTCAACG AGGTCACCGT GCGTTTCCCC
ACGGCGATCT CGTTCGCTCC TGGACGGCCC TATGACGGCT TCTTCGACAT CGAGCAGATC
TTCACGCAGG TGCGGCGTTA TCTCGATCAC CTGGCCGAGC GGGGCGACTC GCCAGCTCAG
ATCCGCGACG CCCTGTTCCA GTACGGCCCG TCCGCCGGAC AGATCCGGAA GCTGATCGCC
GAATCACTCC GGGTGGACGA GGGCATCGAC GTGCCGCCGG AGTCCATCGT GGTCACGGTC
GGCTGCCAGG AGGCGATGTT CCTGGCGCTG CGGGCGCTCA TGCGGGACCC GGGCGACGTG
CTGGTCGTCT CCAGCCCGTG CTACGTGGGG ATCACCGGCG CGGCCCGGCT GTTGGACATC
GAGGTGACCC CGGTCGAGGA ACGGGCGGAC GGGCTGCACT GCGCCGATCT GGCGGCCGCG
GTCGCGGCCG AGCGGGCGCG CGGGCGCCGG GTCAGGGCCG TCTACGTGGT CCCCGACCAC
TCCAACCCCT CCGGGGTGAC CATGCCGCTG CCGGCCCGGC GGGCGCTGCT CGACCTGGCC
GTACGCCTTG ACCTGCTGAT CCTTGAGGAC AGCCCGTACC GGCTGGTCAG CCCGGGCCCG
CAGGTGCCGA CGCTGAAGTC GCTCGATCCG GCGCGCCGGG TGATCCACCT CGGCTCCTAC
TCCAAGACGG TCTTCCCGGG GGCCCGCGTC GGATTCGCGG TCGCCGACCA GGTGGTGCGG
GACGCCTCCG GAGGGACCGG GCTGCTGGCC GACGACCTCG CCAAGATCAA GAGCATGGTC
ACGGTGAACA CCTCGCCGCT CAGCCAGGCC GCCATCGCCG GGGCCCTGCT CGCCGCCGAC
GGCCGGATCT CCGAGCTGAA CGGCGAGACC TCGGCCTACT ACGGCGACAC CATGCGCGCC
ATGCTGCAGT GCCTGGACAA GCACCTGCCG GCCGAGCGGC GGGCCGAGCG CGGGGTGAGC
TGGAACTCAC CCCGCGGCGG ATTCTTCCTG ACCATGCGGG TGCCGTTCCG TGCCGACAAT
GCCGCGCTGA CCCGTTCGGC GCAGGACTTC GGGGTTATCT GGACGCCGAT GTCCTACTTC
TATCCGAAGG GCGGCGGCGA CCACAGCATC CGGCTGTCCA CCAGCTACCT GACCCGCTTC
GACATCGAGG AGGGCATCGC GCGGCTGGTC GGCTTCGTGG AGTCGCAGGC CAACCCGCCG
CGCTGA
 
Protein sequence
METTHYPRKE DLRKEDLHSS VADPVMDTMN FLNEVTVRFP TAISFAPGRP YDGFFDIEQI 
FTQVRRYLDH LAERGDSPAQ IRDALFQYGP SAGQIRKLIA ESLRVDEGID VPPESIVVTV
GCQEAMFLAL RALMRDPGDV LVVSSPCYVG ITGAARLLDI EVTPVEERAD GLHCADLAAA
VAAERARGRR VRAVYVVPDH SNPSGVTMPL PARRALLDLA VRLDLLILED SPYRLVSPGP
QVPTLKSLDP ARRVIHLGSY SKTVFPGARV GFAVADQVVR DASGGTGLLA DDLAKIKSMV
TVNTSPLSQA AIAGALLAAD GRISELNGET SAYYGDTMRA MLQCLDKHLP AERRAERGVS
WNSPRGGFFL TMRVPFRADN AALTRSAQDF GVIWTPMSYF YPKGGGDHSI RLSTSYLTRF
DIEEGIARLV GFVESQANPP R