Gene Francci3_2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2506 
Symbol 
ID3904884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2957493 
End bp2959817 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content66% 
IMG OID637879836 
Producthydantoinase B/oxoprolinase 
Protein accessionYP_481602 
Protein GI86741202 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.615482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCA CGGACGACAG CGCCACGCAG GAGCAGGCCG CCGAGCAGAA CACCCTCACG 
CCCCAGGAGC GGGAGTGGGT CGACCAGTTC ATGGACGAGA CCACCCTCTT CCTCGGTCCC
GACCCGGTGA TCATGCGTGA CCACTCCATC CAGGAACGGA CGTCGCGGGA GGAGACGGCG
ATCGCCGCCG GTGTCGACCG GCTGGTGGTC GAGCGGATCC GCAAGCGGAT TGCCGGTGCG
CTCGACGAGG GCTACGAGAT GTGCGAGGCG CAGGGTGCCG CGCCCGGGGC GAAGTGGGGT
GACCTGACGA CCGCGATCTA CACCGCGGCC GGGGACGTGT CCTACCTCTC CTGCCACGGG
GTGATCGCGT TCTCCGCGAT CCTGCACCAC CCGATCCGGT ACATCATGAA GTACTGGAAG
GACGAGCCGA CCGTCGGTAT CCACGAGGGC GACGGATTCA TCCACAACGA CGCCCGGTTC
GGCAACGTCC ACAACACCGA CCAGTCGATG ATCATGCCGA TCATACGTGG GGGTGAGATC
ATCGCGTGGG TGGCGGCGAC CATCCACGAA GGCGAGAACG GCGCCTGCGA GCCGGGCGGC
ATGCCGTCGG GCTCGGAGAC CCCGTTCGAC GACGGCCTCC GGATGAGCCC GTTCAAGATC
GTCGAGCGTG GTCATCTGCG CCGGGACCTG CTGACCTTCC TCCAGCACTC GGTGCGCGAC
CCCAAGCTGC AGCTGGCCGA CCTGAAGGTG AAGATCACCG CGGTACGCAA GATCATGGAG
CGCATCGACA AGGTCATCGA CGAGGTCGGC GTCGACACGT TCGTGGCGGC CCTGCGGGTC
ACCGTCGAGG ACGTCGACGC GGAGGTCCGC CGCCGGATCT CCGAGCTTCC CGACGGCACG
TACTCCTTCA ACCAGTTCAT GGACTCGACG CTGAAGGAGA ACATCCTCAT CAAGATCGCC
TGCAGGATCC ACGTCAAGGG CGACAAGATG ACCGTCGACC TGCGTGGCAC CGGACCGGAG
ATCATCAACC GGGCGATCAA CTCTCCGCTG TGCTCGGTGA AGTCGATGAT GATGCAGGCG
ATCCTGGCGT TCTGGTGGCC AGACCTGCCG CGCTGCACGG CGGCGATGAG CTGCATCGAG
ATCATCTCCG ACGAGGGCAC CTGGGCCGAC GCGTCCTACG ACGCCCCGAT GGGACAGTCG
CTGCAGGCCT CGTTCCGAGG CTTCTCGATG ATGCAGGCGC TCTACGGAAA GATGTCGTTC
TCCACGCCGC ACAAGTACTC GAACATCGTG GCCAACTGGT TCAACCAGAT CAACACGTTC
CTGTGGGGCG GTGTCACCCA ACACGGCGAC ATGGTCGGCA ACCTGTGTGC CGACCTCAAC
GGCATGCCCG GGGGAGCCAA GCCCTTCCGG GACGGTGAGG ACGCCGTCTC GCCGCTCTTC
TGCGCCATGG CCGACACGGC CGAGCAGGAG GTCATGGAGG AGGAGGTGCC CTTCATGCAG
CTGGTGGCCA AGCGCCTGGT CCGCGACAAC ATGGGCTTCG GCAAGTTCAC CGGCGGCATG
GGCTACGAGA TGATCGTGGC CGCCGAGGGC ACGCCGGAGT GGGGCTTCAT GACGGTGACC
TCCGGAGCGA AGTTCTCGTC CATCTACGGC ATGTTCGGGG GCTACGGCTG CGGCACCTAC
CCGCTGGCGA TGGTCAAGGG CACGAATGTC TACGAGCACA TTCGTCGGGA CAACAAGAAG
TTCGACCTCT CGATCGAGAA GGTCATGAAC GAGCGTCCGT TCCCGGACGG GAAGTACTCG
ACCTATCACA TGGGTCTGCA GTACGACCGC GCCAAGGACG GCGAGCTCTA CATGATCTCC
CAGGGCGCCG GTGGTGGGTA CGGCGACCCG CTGGAGCGCC TGCCCGAGTC GGTGGTGCGC
GATGCCGAGC TCGGCCGGAT CAGCCAGAAG GTCGCCGAGG AGATCTTCGG TGTCCGCTAT
GACCCGATCA CCTTCCGGCT CGACGCCGAG GGCACCAGGC AGGCCCGCGA GCGGGTCCGC
CAGACGCGCC TGACGCGCGG CAAGCCCTAC GCGGAGTTCG TCAAGGATTT CGTCACCGAG
GAGCCGCCGA AGGACCTCCT CTACTACGGC TCCTGGGGCG ACGACACCAA GGACCTCACC
GCCACGGTGT TCACCATCGA CGGTCCCCAG CGGGTCAAGG CGCCGCTGAA GGAACTGCCG
ATCATCGTGA TTCCGGACCG CCGGGAGCTG AAGATCGCGG CGCTGGAGGC GCGCGTGCGG
GAGCTGGAGG ACAGGCACGG CGAGGACGTC AAGCGTCTCG CCTGA
 
Protein sequence
MTVTDDSATQ EQAAEQNTLT PQEREWVDQF MDETTLFLGP DPVIMRDHSI QERTSREETA 
IAAGVDRLVV ERIRKRIAGA LDEGYEMCEA QGAAPGAKWG DLTTAIYTAA GDVSYLSCHG
VIAFSAILHH PIRYIMKYWK DEPTVGIHEG DGFIHNDARF GNVHNTDQSM IMPIIRGGEI
IAWVAATIHE GENGACEPGG MPSGSETPFD DGLRMSPFKI VERGHLRRDL LTFLQHSVRD
PKLQLADLKV KITAVRKIME RIDKVIDEVG VDTFVAALRV TVEDVDAEVR RRISELPDGT
YSFNQFMDST LKENILIKIA CRIHVKGDKM TVDLRGTGPE IINRAINSPL CSVKSMMMQA
ILAFWWPDLP RCTAAMSCIE IISDEGTWAD ASYDAPMGQS LQASFRGFSM MQALYGKMSF
STPHKYSNIV ANWFNQINTF LWGGVTQHGD MVGNLCADLN GMPGGAKPFR DGEDAVSPLF
CAMADTAEQE VMEEEVPFMQ LVAKRLVRDN MGFGKFTGGM GYEMIVAAEG TPEWGFMTVT
SGAKFSSIYG MFGGYGCGTY PLAMVKGTNV YEHIRRDNKK FDLSIEKVMN ERPFPDGKYS
TYHMGLQYDR AKDGELYMIS QGAGGGYGDP LERLPESVVR DAELGRISQK VAEEIFGVRY
DPITFRLDAE GTRQARERVR QTRLTRGKPY AEFVKDFVTE EPPKDLLYYG SWGDDTKDLT
ATVFTIDGPQ RVKAPLKELP IIVIPDRREL KIAALEARVR ELEDRHGEDV KRLA