Gene Francci3_2509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2509 
Symbol 
ID3904653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2962483 
End bp2964396 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content69% 
IMG OID637879839 
Producthydantoinase/oxoprolinase 
Protein accessionYP_481605 
Protein GI86741205 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.212275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC TGATCAACAT CGACAACGGT GGCACGCTTA CCGACATCTG CGTCTGGGAT 
GGCGACCAGT TCACCTACAC CAAGTCCCTC ACTACTCCGC ACGACCTGTC GGAGTGCCTC
TTCGACGGGA TCGAGAAGGC CTCGGTCGCC CTCTACGGCG AAGCCAACAC GGAGAAGCTG
CTGCACGCTA CGCAGCACAT CCGATACTCG ACCACCCAGG GCACCAACGC CCTGGTCGAG
CGGCGGGGCC CGATGATCGG CATCCTCACC ACGATGCCGG GCCTGTTCGA GCGGATGCGC
GGCGGTGAAG CCGAGAAGGA CCTCTTCGAC GGCCTGATCG CCGACCGGAT GCTCACCATC
GACATGGGCG CGACCGATGA GGAGATCGAC TTCGAAGTCG TCCAGCGGAT CAACCAGCTC
ACCACCCTGG GCGCCGCCCG GGTGGTGGTG GCTGGTGAGT CGCCGGAACA GGAGCGTCGC
CTACGCAGCG TGTTGCTGCG CAAGTTCCCG CGGCACCTGC TCGGCTCGAT CCCGCTCCTG
TACTCGTGGT CCCTCGCCGG CGACCGGGAC CACCCGCGTC GGGTGTGGTC GTGCGTGCTC
AACTCGTTCC TGCACCCGAC CATGGAGCGG TTCCTGTACG GGGCGGAGCG GCGGCTGAAG
TCGTACAGGG TGCTCAACCC GCTGCTGGTC TACCGCAACG ACGGAGCCTC CTCGCGGGTG
GCCAAGGCGG TGGCGCTGAA GACGTACTCC TCCGGGCCGC GCGGCGGCTT GGAAGGTACG
GCGGCGCTGG CTCGCACGTA CGGCCTCGAT CACACGCTGA TGATGGACAT CGGCGGCACC
ACCACCGACG TGGGGGTCGT CCGCGGCGGC GCGGCGGCGG CCGATGAGCG CGGCACCATC
GAGGGCGTCC CGATCTCCTA CCCGATGAGC AACGTGCACT CGACCGGGGT CGGTGGGTCC
TCGGTGATCT CGGTGGTGGA CGGGCAGATC ATGGTCGGGC CGCGCAGTGT GGGGGCGGCC
CCCGGCCCCG CCTGCTTCGG CTTCGGTGGC AAGGAAGCCA CGATCACCGA CGTCAACCTG
CTGCTCGGCG TCCTCGACGC CAGCACCTAC CTCGACGGCA CCTTCCGGCT CGACGCCGAC
CGTTCGGCCG CGGTCATCAC CGAGACCATC GCCGAGCCGC TGGGCATCAG CCTGGAGGAG
TCGCTGATCC GGATGGAGCG GGCCTACTTC GAGGCGCTGG CACACTCCTT CGCGCACCTG
ATCGAGGAGA ACTCGACCCT CATCGCCTTC GGTGGCGCCG GCCCGATGAG CGCCGTTGGT
GCCGCTCGTC CGGCAGGGGT GAAGAAGGTG CTGATCCCGC GGATGGCGGC GGTCTTCTCC
GCGTTCGGCA TTGCCTTCTC CGACATCGGC AAGACCTACG AGGTCGGCGT GCCGGAGCCG
ACCACGGCAA GCACCGCGGC GACGTACGAC GAGATGCTTG CCCGGGCCAG GCGCGACATT
TTCCAGGAGG GCTACGACCT CGACGACTGC CGCACCGAGG TACTGCTCAC CATCGAGGAG
ACCGACGGGT CGCCGGTGGA GACCAGGCCG TACCAGTCCG GCGACGCCGC GGACTTCCCC
GGGAAGCAGG TCTCCCTGCA ACTGTCGGTG ACGGCCGCGC TGCCGCACCC CGACGTTGCT
CCCGACACCG ACGTGCCCGC GATCCGGGTG ACGAGCAATG AGACCCGCCT GGTCCGTTCC
GCGCCCGACC AGGTCGACAA GGTGCCGGTG TTCGTGCTCG CCGAGATGCC GCCCGGTGGA
AGTGGCGAGG GCCCGGTGAT CGTCGAGGGC CCGTTCTTCA CCGCCCGCGT GCTGCCCGGC
TGGCAGTTCC GGGTCACCGC CTCGGGGGAC CTGCTGCTGA CCGACACCCA CTGA
 
Protein sequence
MDTLINIDNG GTLTDICVWD GDQFTYTKSL TTPHDLSECL FDGIEKASVA LYGEANTEKL 
LHATQHIRYS TTQGTNALVE RRGPMIGILT TMPGLFERMR GGEAEKDLFD GLIADRMLTI
DMGATDEEID FEVVQRINQL TTLGAARVVV AGESPEQERR LRSVLLRKFP RHLLGSIPLL
YSWSLAGDRD HPRRVWSCVL NSFLHPTMER FLYGAERRLK SYRVLNPLLV YRNDGASSRV
AKAVALKTYS SGPRGGLEGT AALARTYGLD HTLMMDIGGT TTDVGVVRGG AAAADERGTI
EGVPISYPMS NVHSTGVGGS SVISVVDGQI MVGPRSVGAA PGPACFGFGG KEATITDVNL
LLGVLDASTY LDGTFRLDAD RSAAVITETI AEPLGISLEE SLIRMERAYF EALAHSFAHL
IEENSTLIAF GGAGPMSAVG AARPAGVKKV LIPRMAAVFS AFGIAFSDIG KTYEVGVPEP
TTASTAATYD EMLARARRDI FQEGYDLDDC RTEVLLTIEE TDGSPVETRP YQSGDAADFP
GKQVSLQLSV TAALPHPDVA PDTDVPAIRV TSNETRLVRS APDQVDKVPV FVLAEMPPGG
SGEGPVIVEG PFFTARVLPG WQFRVTASGD LLLTDTH