Gene Franean1_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3901 
Symbol 
ID5672262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4665120 
End bp4666745 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content69% 
IMG OID641242780 
Productcholesterol oxidase 
Protein accessionYP_001508197 
Protein GI158315689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0511249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA ACACGCCCGA CGCCGTCGAA TCCACCCCCG GCCTCAACCG CAGGCACTTA 
ATCGGCACGG CGGCGATCGG CGCCGCCGCG GTCGGCGGTT TCACCGCGGG ACCGGGTGCC
GTCCGGGCCC GGGCCGCCAC CCCCGTGCCG GCCACTGTGC AGCAGGAGCG GGCCGTTGTC
ATCGGGAGCG GCTTCGGCGG CGGCGTGACC GCGCTGCGGC TGGCCCAGGT AGGCGTGTCG
ACGCTGGTCC TGGAGCGCGG GCTGCGGTGG CCCACCGGGC CGAACGCAAC GACGTTCTGC
CGCTTCGCCA ACATCGACAA CCGCTCCGCC TGGCTGACCG ACCACGCCAC GGTCGGCGGC
GTGGTGAAGA CGTGGGAGCC CTACACCGGG GTGATCGAGA GCATTCCCGG CAACGGAATC
ACGGTGAACT GCGGGGCGGC CGTCGGCGGC GGCTCCCTCA TGTACCACGG CATGACACTG
AAGCCCTCGA AGGCGAACTT CGCCGCGTCG ATCCCGGTGG CCGCGAACCT CTACGACGAG
CTGAACCTGT GGGCGTATCC GCTGGTGGCC AGCATGCTGG GAGTCTCCAC GATTCCCAAC
GACATCCTGA ACAGCGACCC GTACAAGTCC TCCCGGCTCT TCCGGGACGT CGCTCCGGGC
GCGGGCCTGG AGCCGTTCCA GGTTCCCCTG CCGATCGACT GGCAGTACGT GCGCGGTGAG
CTGAACGGCC AGTACCAGCC GACCTACACC ACCAGCGACA TCGCCTTCGG GGTCAACAAC
GGCGGTAAGC GCTCCATCGA CGTCACCTAC CTCAAGGCGG CGGAGGCGAC CGGCCGGGTC
CGCGTCGCGA CGCTGCACGT CGTCCGCGAC ATCGCGCTCG ACGCGAACAA GAAGTGGGTA
CTGACGGTCG ACCGCATCAA CACCGGCGGC ATCGTCCAGG AGACGAAGAC CATCGTCGCG
GACGCGGTAT TCCTCAACGC GGGTTCCGCC GGCACGACCC GCCTGCTGGT GAAGTCCAAG
GCGAAGGGGC TTATCCCCAA CCTGCCGGAC GCCGTCGGAA CCCAGTGGGG AAACAACGGC
GACCGCATCT ACCTGTGGAA CGGCATGAAC GGCGACATCG GGACCCAGCA GGGTGGTCCC
GCCTGCGTCG GCGGTCGCGA CACCACCAGC TCGATCCCGC TCACCATCAT CCACGCGGGC
TCCCCCATTC CGAGTACCGC CGGCAAGCTG ATGACGGTCG TCGGCTTCGG GATCGTGAAC
CCCGCCGGCA CCTGGGCGTA CGACTCCGCG AAGGACGACG CCGTCCTGAC CTGGCCGTCC
AGCGGTGACG CCGCGCTGCA GGCGCTGATC GCCGCGCGCA TGCAGAAGAT CGCCCAGGTG
GGCGGCGGCA TCATGATCGA CACGAACGCC CAGGCGAACT CGACCTGGCA CGCCCTGGGC
GGCGTGCCCA TGGGGTCCGC GGTCGACCTC TACGGCCGGG TCATCGGCCA GAGTGGCCTC
TACGTGCTCG ACGGCGCGCG GATCCCGGGC TCCACCGGCG CCTGCAACCC GTCCATGACG
ATCGCGGCCC TCGCCGAGCA CAGCATGGCC AAGATCGTGC TCCAGGACGT CGGACGCGTC
TTCTAG
 
Protein sequence
MPENTPDAVE STPGLNRRHL IGTAAIGAAA VGGFTAGPGA VRARAATPVP ATVQQERAVV 
IGSGFGGGVT ALRLAQVGVS TLVLERGLRW PTGPNATTFC RFANIDNRSA WLTDHATVGG
VVKTWEPYTG VIESIPGNGI TVNCGAAVGG GSLMYHGMTL KPSKANFAAS IPVAANLYDE
LNLWAYPLVA SMLGVSTIPN DILNSDPYKS SRLFRDVAPG AGLEPFQVPL PIDWQYVRGE
LNGQYQPTYT TSDIAFGVNN GGKRSIDVTY LKAAEATGRV RVATLHVVRD IALDANKKWV
LTVDRINTGG IVQETKTIVA DAVFLNAGSA GTTRLLVKSK AKGLIPNLPD AVGTQWGNNG
DRIYLWNGMN GDIGTQQGGP ACVGGRDTTS SIPLTIIHAG SPIPSTAGKL MTVVGFGIVN
PAGTWAYDSA KDDAVLTWPS SGDAALQALI AARMQKIAQV GGGIMIDTNA QANSTWHALG
GVPMGSAVDL YGRVIGQSGL YVLDGARIPG STGACNPSMT IAALAEHSMA KIVLQDVGRV
F