Gene Francci3_3315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3315 
Symbol 
ID3904101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3927165 
End bp3930248 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content69% 
IMG OID637880640 
Productlantibiotic dehydratase-like 
Protein accessionYP_482401 
Protein GI86742001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.301651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGA GATCTCCAGC GATGTACCAG TGGGCCGGTG CAGCACTACT ACGTGCGAGC 
ACCGATCCGG GCGGATTGGA CTTGCCAGCG GACCTGGACC TGTTCGGTGC CGACGCCGCG
GAAGAAGGGT CGGCGTGGCT GTCGGCGATG TGGCGGCGCG AGGAAATTCG CGCGGCGATC
GCTCAGGCGA GTCCCGCGCT GATTCAGCAG GTTGACACCG TTCTGACCTC CAGTGGTCAT
GACGTGCGAG TGGTTCGCCG GACCGTGCTT TCCGTAGCGT CCTATCTGCT TCGGTGGCAG
CGCCGTCCCA CTCCCTTTGG GCTGTTCGCC GGAGTCGCGC TGGCCCGGAT CGATGCTGGG
GCGAAGGTGC GGTGGGGTCG CGATCACCGG GTCGAGGCCC GGGTTGACGC GGGCTGGCTC
GGTGATGTCC TCGCGCGCCT GCAACGGTGT CCGACGCTGC GGGAGCGGCT GTCGCTGGTC
GCCAATGGCG CCGGGTTGGT GCGCGGTGAC CGCTTCGGAG CGCCCGCGCC GACGCCGGAT
GGCATAGCGG ACGAGTTGGC GCCGATCGAG GTGTCCGTGC GCCACAGTCG ACCGGTCTGC
GCCGCGCTGG AGGCCACGCG GAAACCGGTC ACGTTCAGCG AGCTACGAAC ACTGCTCATG
GAGCGCTTCC CCAGTGCCCC CGCGCAGCGG ATCGACGAGA TGCTCACGGG TCTGCTCGAC
CAAGGAATCC TGCTGAGCAA TCTGTCAGCG CCGATGACCT GCCTGGATGC ACTCGGTCAT
GCGTGCGCTC AGCTGGAGGC CGTTGACGCC CACAGCATTC CGGAGGTTAG CGATCTTGTC
CGTTCGATGT TCGAGATCCA CAAGGAGGTG TCGGCCACCA GCCAGGTTCT CGGGTCAAGG
TCGGCCGTGA CCGAGCAGAT GCACGCGCTG AGCGAGGCCG CCGAGGTACC CATGATCGTC
GACACGATCC TGGAGTGCGA CGTTCACATA CCGGACCAGG TCGCCCAGGA AGCCCGCAAC
GCCGTCCAGG TCCTCTATCG ACTCTCACCG TATCCGTTGG GTTATCCCGC CTGGCGGGAC
TACCACTCCC GGTTCCGGAC CCGCTACGGG ACGGGCGCCT TCGTGCCGGT CATGGACCTG
ATCTCCGACA GTGGCCTGGG AGTTCCGGCC GACTATCTGG GCTCGGCGCG CAGGCGTGCC
GCTCGGCAGG TGAGTGAACG TGACGAGAAA CTACTGGCGC TGATCCAACG GGCCACGCTG
TCCGGCGGCG GCGAAATCGT CCTGACTGAT CAGATGATCG AGGAGCTTGC GGTCAGCGAT
CCGGCCGACG TGCACCTGCC TGCTCGGGTC GAGGTGGCCG TGGAGATCCG CTCCATGTCC
GTTGAGGCGC TGGCCCGCGG CCGGTTCACG GTGGCGGTGA CCGGCACGCC ACGGCCCGGC
AGCAGCATGG CTGGCCGCTA CGCCCACCTG CTGCCAGCGG ACGGCCGCGA CCTGATCGCG
GGCACCTTCG CTGCGGCCGG CACCGACGCG ATCCCCGCGC AGCTCTCCTT CGCCCCGCGT
AAGCGGCGCA ACGAGAACGT CGCGCGCACG CAGCAGCTCC TGACACATGT GATCCCCGTG
GCCGAATACC GCGACGGCGA CGAACGCCTG ATCCCCCTGA CGGACCTCGC GGTCAGCGTG
GACGACCGCC GCTTCTACCT CGCCCAGATC TCCACCGGCC GGTACGTCGA ACCGCGGGTC
GCCCACGCCC TGGAGGCCGG CGTGCACACC CCGCCGCTCG CGAGGTTCCT CGCTGAGATC
ACCACCGCCC GAGCCGCCGT GTACAAGGCA TTCCACTTCG GCGCGGCGGC ACAGCTTCCC
TACCTTCCAC GCGTTCGATA CCGGCGCACC GTGCTGTCTC CGGCACGGTG GCTGCTAGCG
GCCGGTGAAC TTCCCGGCCG CGGCGCCTCG ACGGCCGAGT GGGACGCCGC GCTGGAAGAC
TGGTGCAGCC GGTGGTGGGT TCCCGGCCAT GTCGCGATGG TGGAGCACGA CCGGCGGCAG
CCGGTAGACC TCGGCCACCC GCTTCACCGT CTCCTGCTGC GCACCCGGCT GGAACGCGCT
GACCGCCTGG AACTGCGCGA GACGTCGACC CTGGAAGACG TGGCCTGGCT GGGGCGTGCC
CACGAGGTGC TGATCCCCAT GGTCTTGGAC CCGCAGCCCG CCACAGATCC CGGGCCAGGC
ATCAGCACAC GGCGAGTCGT GGCCGTCGAC GCCGGGCATC TCCCCGGCGA GTCCACGGTC
GTGTCCGCGC ACCTGTACGG GCATCCGGCG CGCGTCGAGG AACTCCTGAC GCAACACCTT
CCCCACATGA TCGACGCCTT CGGCGTCCAC AGGCCGCGCT GGTGGTTTCG GCGGAACCGC
GAAATGCGCA GACCAGAGAT CGACCAGTAC CTCGCCGTAT ACCTCTGGCT ATCGGAGCCC
TCCGCATACG GCCCTGCCGC CGCATGCCTT GCCCGGTGGG CCGACGATCT GCGCCGACAA
CACCTGCTCG CGCACGTCTC GCTCACCACC TATGACCCCC AGTCGGGACG TTACGGACAC
AGCCCAGCCC TGGACCACGT CCAGGACGTC TTCGCCGCCG ACTCGGCCTG CGCCATCGCC
CAGATCAGCG CATCCATCCG CGCAGGCGTG CATCCCCAGG CCCTGGCCGC TGCCAGCCTG
GTCGACCTGG CAGTGAGCTA CGCCGGGTCC CCACAAGACG GGCTGGACTG GCTGATCCGC
GAACTCCGCC AAGAACACGG AAGGCTGGAC CCCGCGCTAC GGCAACAGAC ACTCGAACTA
GCCGACCCGC ACGGCAGTTG GACGCGGCTG CAATCCCTGC CCGGCGGACG CGATGTCCTG
GCTGCCTGGG GCACCCGCGC CAGTGCGCTG GCGGCGTACC GAGATGCCCT CGCTGACCAA
CGCGACCCGA TGCCGGTCCT GCGATCGCTC CTGCACCTGC ACCACAATCG CGCTGTCGGT
GTCGACCCGG CTGTCGAACG AGCCACCGGC CGGCTCGCAC GGGCCTGCGC GCTGCGCCAC
ACCGCCCACC GCACGGAGAC ATGA
 
Protein sequence
MAVRSPAMYQ WAGAALLRAS TDPGGLDLPA DLDLFGADAA EEGSAWLSAM WRREEIRAAI 
AQASPALIQQ VDTVLTSSGH DVRVVRRTVL SVASYLLRWQ RRPTPFGLFA GVALARIDAG
AKVRWGRDHR VEARVDAGWL GDVLARLQRC PTLRERLSLV ANGAGLVRGD RFGAPAPTPD
GIADELAPIE VSVRHSRPVC AALEATRKPV TFSELRTLLM ERFPSAPAQR IDEMLTGLLD
QGILLSNLSA PMTCLDALGH ACAQLEAVDA HSIPEVSDLV RSMFEIHKEV SATSQVLGSR
SAVTEQMHAL SEAAEVPMIV DTILECDVHI PDQVAQEARN AVQVLYRLSP YPLGYPAWRD
YHSRFRTRYG TGAFVPVMDL ISDSGLGVPA DYLGSARRRA ARQVSERDEK LLALIQRATL
SGGGEIVLTD QMIEELAVSD PADVHLPARV EVAVEIRSMS VEALARGRFT VAVTGTPRPG
SSMAGRYAHL LPADGRDLIA GTFAAAGTDA IPAQLSFAPR KRRNENVART QQLLTHVIPV
AEYRDGDERL IPLTDLAVSV DDRRFYLAQI STGRYVEPRV AHALEAGVHT PPLARFLAEI
TTARAAVYKA FHFGAAAQLP YLPRVRYRRT VLSPARWLLA AGELPGRGAS TAEWDAALED
WCSRWWVPGH VAMVEHDRRQ PVDLGHPLHR LLLRTRLERA DRLELRETST LEDVAWLGRA
HEVLIPMVLD PQPATDPGPG ISTRRVVAVD AGHLPGESTV VSAHLYGHPA RVEELLTQHL
PHMIDAFGVH RPRWWFRRNR EMRRPEIDQY LAVYLWLSEP SAYGPAAACL ARWADDLRRQ
HLLAHVSLTT YDPQSGRYGH SPALDHVQDV FAADSACAIA QISASIRAGV HPQALAAASL
VDLAVSYAGS PQDGLDWLIR ELRQEHGRLD PALRQQTLEL ADPHGSWTRL QSLPGGRDVL
AAWGTRASAL AAYRDALADQ RDPMPVLRSL LHLHHNRAVG VDPAVERATG RLARACALRH
TAHRTET