Gene Francci3_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1821 
Symbol 
ID3906212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2156428 
End bp2159652 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content73% 
IMG OID637879159 
Productlantibiotic dehydratase-like 
Protein accessionYP_480926 
Protein GI86740526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.525811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCA TTCGCGACCC AGGCGGGTCC GCTGCACCAC GCTCGGCGAT CCCGCGAGCC 
ACGCCGTCAG AGGTCCGCGA CTCCGACTTG GCGAGCGGCC CGCAGGTCGC CGGGCTCGGC
GCTCGGGCCA GCGACGCGCT GTACAGGGTG GCTGGCCCGG GTGTCGTTCG GGTGGCCGAA
CCGATGCCAG GTCTGGCCGC GCTGCCTTGG CCTGACCTTG GGGACGGGGG CGAGGAGACC
GAGCGGTGGT GCCAGTGGCT GCGCGAGGCT TGGGCGTGTG CGCCGTTCGC CGCAGCCGTG
CAGGCCGCGA GCCCAGTGCT CGCGCGCCGG ATCGCCCAGG TGTGCGCGGG CGGCGATCTA
CCCGCACGGC AGGCACGTAG CGCCGTGCTG TCGCTCATGC GGTACCGGTT ACGGCTGGTG
AGCCGAGCTA CACCGTTCGG CCTGTTCGCC GGGGTCGGCC CGGCCCGGCC CGGTACCGCG
CTCACCGTGT ATGAAGACCC GACGCACCAT CGCATGCTGG CACGGGTGGA CACGGCGTGG
TTGTTCGATG TGCTTGACCA GCTCGAAGGC GTTCCGGGTG TTCTCGACGT CCTGCCCGTC
CGCGCGAACG ATCTCGCCTT CGTTCGTGAT GGTCGGCTGG TACTGCCGTT CGAGGGCCCG
CGAGCCGGGA GCGACGCCGA GGTGTCCACA GCCGAGACCT CGGTCCGATA CACACAGGCT
GTCGCGGCGG TGATGAAGGC TGCGGCCGGA CCCGTTCTCG TCGGGACGTT GGCCGCCCAG
CTCGGCACCG AGTTCCCGGC CGCGCCTGCG CGGGTGCTTC GAGGCATGCT CTCGACACTG
GTCGAGCGGC GGTTCCTGCT GTCCGGGCTT CGGCCACCGT CCACCGAACT CGACCCGCTC
GGCGCGGTCC TCGCGACGCT GCGGGCGATC GGTGCCGACC CGGTACCCGA GGCGGCCGAT
CTCGTGGACG CCCTGCGCCG CCGCTCTCGC GGGCTCGCCG ACATCACGGA ACCGTGGACG
TCACAGCCGG CGACCCCAGA CACCGCAGCC CCGACCTCAA GCCCCTCCGG GCGCCGGCCT
CCCATCGCCC TCGACATGCG GGCCGGCGTC GGGCTGGAAC TGCCTCGGCT GCTCCTCCGA
GAGGTAGAAG CAGCCGTCGA CCTGCTGGTC CGGCTCGCGC CGGCACCCCG GGGCAACGCC
GCCTGGCGCG ACTATCATGA TCGATTTCTT GAGCGCTTTG GTGCCGGCGC GTTCGTGCCG
GTCAACTCCC TGATCAATTG TGAAACCGGG CTCGGCCTGC CCGCCGGATA CCGGGGAACC
ACACTCGGCC CGGCAACGGC CGGCCCGCTC TCGGACCGCG ACGCGCTGCT GCTTGCCCTG
GCGCAGACCG CGGCGGCCCG CCGAGACCGC GAAGTCACCC TTGATGCCGA GTCGCTCGCG
CTGCTCCGCA CGAGCGACGC GGTGGGCTGG ACCTGCCAGC CCCACACGGA GCTGCGGTTT
CGGCTGCACG CAGCAGACCG CCGCGCGGTG GAACGCGGCG CGTTCGACCT CGCTGTCGAG
GGAGTCTCAC GGGCCGCCGG GACAACTCTG GGCCGCTTCC TCGACCTCGC CAACGAGGCC
GACCGCGAGG AGATGACCTC GGCACTTCGC GCGCTGCCGA CCCGCCAGCC GGGGGCACTT
CTCGTCCAGT TGTCGGGTGG CACGCTGTCG GCCACCGCTG CGAACGTCTC CCGCGCGCCG
CGCATCCTGG ACCATCTCAT CGCGCTCGGC GAGTACCAGC CGCCGGACCG CGGGATGATC
CCCGTGACGG ACCTCGCCGT CACCGCAGAC GGTTCGGGTC TGTGGCTGGT CTCGTTGTCC
CTGGGCCGAC CGGTGGAGCC GGTCGCGTTC CACGCCGTCG AACTCACCCG TCACGGACAC
CCCCTGCTCC GGTTCCTCAG CGAGATCGGC ACTTCCCGCG CCGCCCCCTG CGCGCCGTTC
TCCTGGGGCG CGGCCCGACG GCTGCCGTTC CTGCCACGCC TGCGCCACGG GCGGACGATC
CTCGCCCCGG CCCGCTGGCT CCTGGCCCCG GCTGACCTGC CCGGACCGGA CGCCACCTTC
GCCCGGTGGA GCGACGACCT CGCGGCCTGG CGGGACCAAT GGGGCGTTCC CGACCAGGTC
TTCCTCGGCA GCGACGACCG CCGCCTCCTG CTCGATCTGA CCGAGCCCGC CCACCTCCAC
CTGCTGCGCG CAGACTTGGG CCGCGCCACG CGGGCAACCC TGCGGGAAGC ACCGCCGCCT
GACGCAGCCG GATGGATCGG CGGGCGGACC CACGAGATCG TCCTCCCGCT CGCCGCACCT
CCGACCCCGA CGGCGGAGGT ACCGCGCCCA CGGCGCCCGG CGCGGATCGC GACCTCGGCG
GACGCCCATC TACCCGGAGA CGGCGCCTGG CTGTACGCCA AGCTCTACGC CCAGCCCGAC
CGTCAGGTCA CCATCCTCAC CGAACGCCTC GCCAGCCTGT GGGAACACTG GGACACCCCG
CCGCTGTGGT GGTTCCAGCG ATACCAGGAC CCAGCCCCAC ATCTGCGGCT ACGAATCCGC
CTCATCGACC CTGACGGGTT CGGCGACGCG GCCCGGAGAG TGGGCCGCTG GGCGACCGCA
CTGCGCCAAG CCGGCCTACT GGACCAGCTC CAGTTCGACA CCTACCTTCC AGAGACCGGC
CGCTTCGGCG GCGCCTCGAC GCTCGCCGCC ACGGAGACGC TGTTCGCCGC GGACTCCACC
GCCGCACTCG CCCAGCTCGC AGCCTCCGCC CGCGGTGCCA CCCACCCCCA TGCGCTGGTC
GCCGTGAGCC TGCTCGACCT CGCCGCGGGC TGCCTACCCG GAGACGACGC CGCCCGCTGG
CTCGTCGAGC AGCTACCACG CACGCAGGGA CCGCCGATCG ACCGCGCGCA GCACGACGCC
GCCGTTCACC TCGCTGACCC GCGCGAAAGC CAGGCGACGA TGCACACCCT GCCGGGCGGG
AAGGAGATCC TCACCGCCTG GGCGCGCCGC CGGGCGAGGC TCGCCGACTA TCAGACGGCG
CTCACCTCGG CCGGCGACGG ACTCACGACC CGCGGCCTCT TGCCGACGCT GATGCACCTC
CACCAGTTCC GGATGACGGG GCCGTCCGTC CAGGCAGAGC GCGACTGCGC CCGCCTCACC
CGCGCCGTTG CGCTCAGCGT TCTCCGCCGC CGGGAGATGG CATGA
 
Protein sequence
MPTIRDPGGS AAPRSAIPRA TPSEVRDSDL ASGPQVAGLG ARASDALYRV AGPGVVRVAE 
PMPGLAALPW PDLGDGGEET ERWCQWLREA WACAPFAAAV QAASPVLARR IAQVCAGGDL
PARQARSAVL SLMRYRLRLV SRATPFGLFA GVGPARPGTA LTVYEDPTHH RMLARVDTAW
LFDVLDQLEG VPGVLDVLPV RANDLAFVRD GRLVLPFEGP RAGSDAEVST AETSVRYTQA
VAAVMKAAAG PVLVGTLAAQ LGTEFPAAPA RVLRGMLSTL VERRFLLSGL RPPSTELDPL
GAVLATLRAI GADPVPEAAD LVDALRRRSR GLADITEPWT SQPATPDTAA PTSSPSGRRP
PIALDMRAGV GLELPRLLLR EVEAAVDLLV RLAPAPRGNA AWRDYHDRFL ERFGAGAFVP
VNSLINCETG LGLPAGYRGT TLGPATAGPL SDRDALLLAL AQTAAARRDR EVTLDAESLA
LLRTSDAVGW TCQPHTELRF RLHAADRRAV ERGAFDLAVE GVSRAAGTTL GRFLDLANEA
DREEMTSALR ALPTRQPGAL LVQLSGGTLS ATAANVSRAP RILDHLIALG EYQPPDRGMI
PVTDLAVTAD GSGLWLVSLS LGRPVEPVAF HAVELTRHGH PLLRFLSEIG TSRAAPCAPF
SWGAARRLPF LPRLRHGRTI LAPARWLLAP ADLPGPDATF ARWSDDLAAW RDQWGVPDQV
FLGSDDRRLL LDLTEPAHLH LLRADLGRAT RATLREAPPP DAAGWIGGRT HEIVLPLAAP
PTPTAEVPRP RRPARIATSA DAHLPGDGAW LYAKLYAQPD RQVTILTERL ASLWEHWDTP
PLWWFQRYQD PAPHLRLRIR LIDPDGFGDA ARRVGRWATA LRQAGLLDQL QFDTYLPETG
RFGGASTLAA TETLFAADST AALAQLAASA RGATHPHALV AVSLLDLAAG CLPGDDAARW
LVEQLPRTQG PPIDRAQHDA AVHLADPRES QATMHTLPGG KEILTAWARR RARLADYQTA
LTSAGDGLTT RGLLPTLMHL HQFRMTGPSV QAERDCARLT RAVALSVLRR REMA