Gene Francci3_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1251 
Symbol 
ID3903550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1495355 
End bp1496584 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID637878585 
Productalkane 1-monooxygenase 
Protein accessionYP_480358 
Protein GI86739958 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.10941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAC CTGTCCTCAC CGCGCCCGGC GGCCTCGCCT GGCGGGACGG CAAGCGGTAC 
ATGTGGCTCA CCGCGATCAT CGTTCCGTTG CTGGCCATCG TGGGATACGG CCTGTGGCAT
CGGACGGGTG TCGAGCTGCA CTGGTGGATC ACCCCGCTCT TCGTCTTCGT CCTGGTTCCG
CTGCTCGACA TGACCACACC GCCGGATCCG GTCAATCCAC CGGAGGAGAT CCGGGCCGCG
CTGGAGGCCG ACCGTTACTA CCGCCGGTGC ACGTATCTCT ACCCGCCGCT GGCGGGCGTC
GGCCTGCTGC TGGGCGCCTC GGCCTGGACG AACGGCGACC TGTCGTGGTC CGCCCGGATC
GGCGTGGTGG TGTCGGTCGG CACCGTGACC GGGGTGGGCA TCACCACCGC GCACGAGCTC
GGCCACAAGC GGGGGATTTT CGAGCGCTGG TTGGCGAGGT TGATGCTGGC GCCGGCCGCG
TACGGTCACT TCTCCGTAGA GCACAACCGG GGCCATCACG TGCGGGTAGC GACGGCGGCG
GACCCGGCGA GCGCCCGGTT GGGGGAGAGC TTCTGGCGGT TCTGGCCGCG GACCGTCGTC
GGCAGCCTGC GTTCCGCCTG GTCGCTGGAG GCCGCCCGGC TACGGTTGCG CGGCCGACGG
GTCTGGTCGG TGCGCAACGA GGTCGTGCTT GGCTGGTTGC TCACCGCGGT GTTGTTCCTG
GCGCTGGCCG TCGAGTTCGG GCCGGCTGTA CTCGTCTTCC TCGTCGCCCA GGCGGCCTTC
GGGTTCACCT TGTTGGAGGG GGTCAACTAC ATCGAGCACT ACGGCCTGGC CCGGGAGCTG
ACTCCGAGCG GGCGTTATGA GAAGGTCGAC CCCCGGCACA GCTGGAACAG CGACGCCGTG
ATCAGCAATC TGGCGCTTTA CCAGTTGCAG CGACACAGCG ACCACCACGC CAACCCGACC
CGGCGCTACC AGGCACTGCG GTCCTTCGAG GCCTCTCCCC AGCTACCGGC GGGGTACGCC
ACGCTCCTGC TCGCGGCGTA TCTCCCGCCG GTGTGGTTTC GGGTGATGGA CGACCGCGTC
GTCGAGCATT ACGGCGGGGA CGTCAGCCGG GCGAACCTGC ACCCCGCGCG CCGGGCCGCT
TTGCTGACCC GGTACCGGTC GCCGTCCGGG CCGCCGCCGT CCGGTACCGA GGTCGACGGG
TCTCCGTCCC GGGGCGGTGT TCGTGGGTGA
 
Protein sequence
MAAPVLTAPG GLAWRDGKRY MWLTAIIVPL LAIVGYGLWH RTGVELHWWI TPLFVFVLVP 
LLDMTTPPDP VNPPEEIRAA LEADRYYRRC TYLYPPLAGV GLLLGASAWT NGDLSWSARI
GVVVSVGTVT GVGITTAHEL GHKRGIFERW LARLMLAPAA YGHFSVEHNR GHHVRVATAA
DPASARLGES FWRFWPRTVV GSLRSAWSLE AARLRLRGRR VWSVRNEVVL GWLLTAVLFL
ALAVEFGPAV LVFLVAQAAF GFTLLEGVNY IEHYGLAREL TPSGRYEKVD PRHSWNSDAV
ISNLALYQLQ RHSDHHANPT RRYQALRSFE ASPQLPAGYA TLLLAAYLPP VWFRVMDDRV
VEHYGGDVSR ANLHPARRAA LLTRYRSPSG PPPSGTEVDG SPSRGGVRG