Gene Francci3_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2807 
Symbol 
ID3904953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3305572 
End bp3306774 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content74% 
IMG OID637880128 
Productaminotransferase, class I and II 
Protein accessionYP_481894 
Protein GI86741494 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACG CCGCTCCACC CGCCGCTGCG GTGCCCGGTC TCGCCGCCCG AGCCGGCGCC 
GTGCGCACCT CGCCAGTGCG GGAGATCCTC GCGCTGACCG CCCACCCGGA GGTGATCTCC
TTCGCCGGCG GTCTACCCGC TCCCGAGCTC TTCGACGCCG ACGGAATCCG GCGCGCCTAC
GACCGGGTGC TCACCGAGGA GCCGCGGCGC GTGCTGCAGT ACTCCGCCAC GGAGGGCGAC
CCGGACCTGC GCCAGACGGT CGCGGAACGC CTGGCGCGCC GCGGCCTGCC CACCCGAGCG
GACGACCTGC TGATCACGAC GGGCTCGCAG CAGGCCCTGA CGCTGCTGTC GGCGACGCTC
GTGGAGCCGG GTGACGCGGT CCTCGTGGAG GATCCGACCT ATCTGGCTGC CCTGCAGTGC
CTCGGCCTCG CGGGTGCCCG GGTGGTGCCC GTCCCCACCG ACGAGCACGG GATCCTCCCC
GACCGGTTGG CGGAGACGGT GGCCCGCGAG CGTCCCAGAC TGCTCTACCT GGTGCCGACC
TTCCAGAACC CCACCGGTCG GACGCTGTCC GCGGCGCGCC GCGCGGCCGT CGCCCGGGTG
GCCGCGGAGC AGGGCCTGTG GATCGTCGAG GACGACCCCT ACGGTGAGCT GCGCTTCCAG
GGCACGCGGG AGCCGTGGAT CGCGTCGTTC GCCGACGCCG CCGACCGCAC CGTCCTGCTC
GGCAGCTTCT CCAAGATCAT AGCCCCAGGG ATGCGACTGG GCTGGCTGCG GGCCCCGGCC
ATGCTGCGGC GGGCCTGCGT GATCGCCAAG CAGGCCGCGG ACCTGCACAC CTCCACGGTC
GATCAGGCCG CCGCCGCCCG GTATCTGGCC GACGCCGACC CCGACCTGCA CATTGCTCGG
ATGTGCGCCG CCTACCGCGA ACGCCGCGAC GCGCTCCTCG ACGGGCTCGC CTGGGCGCTG
CCGCCCGGGA GCAGCTGGAA CCGGCCCGAG GGCGGCATGT TCGTCTGGGC CCGCCTCCCC
GATGGTCACG ACGCCACCGC CCTGCTGCCC GGCGTGGTGG AGCACGACGT GGCCTACGTT
CCCGGTGCTC CTTTTTTCGC GGGGCCGCCG GACCCCGCGA CGCTGCGCCT GTCCTTCACC
ACGCACCCGC CGGTGGAGAT CACGAAGGGG TTGACGCGGC TGGCACGGGC GTTCGGAGTA
TGA
 
Protein sequence
MTNAAPPAAA VPGLAARAGA VRTSPVREIL ALTAHPEVIS FAGGLPAPEL FDADGIRRAY 
DRVLTEEPRR VLQYSATEGD PDLRQTVAER LARRGLPTRA DDLLITTGSQ QALTLLSATL
VEPGDAVLVE DPTYLAALQC LGLAGARVVP VPTDEHGILP DRLAETVARE RPRLLYLVPT
FQNPTGRTLS AARRAAVARV AAEQGLWIVE DDPYGELRFQ GTREPWIASF ADAADRTVLL
GSFSKIIAPG MRLGWLRAPA MLRRACVIAK QAADLHTSTV DQAAAARYLA DADPDLHIAR
MCAAYRERRD ALLDGLAWAL PPGSSWNRPE GGMFVWARLP DGHDATALLP GVVEHDVAYV
PGAPFFAGPP DPATLRLSFT THPPVEITKG LTRLARAFGV