Gene Francci3_3671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3671 
Symbol 
ID3905355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4397966 
End bp4399807 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content70% 
IMG OID637880997 
Producthypothetical protein 
Protein accessionYP_482752 
Protein GI86742352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0687047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCC GTCGCGCGCC GGGTGATCCG ACGGGAGCAT GTCACGCCAA CACCGAAAGG 
CAGCGCGCGA GCATGGCCTC GATCATCCGC GGCCGACCCG CCCCTCCGGT CGGCCAGCCG
ATTCCCTCAG CCCATCGACG ACCCCGCAGC GGCCGGATCC ACACCCAGCG CCGGGTCCGG
ACCGGAGCCC GGATCGGCCT GATAGCCGTG TTCGTGGTGG GTCTCCTCCA GGCGGTCACC
CCGCTGGCCC GCGCCGCGGG AAAGTCACCG TCGCCGACTC CGGCCGTGTC GCCATCCGCG
ACGAGCGGAT CCAGCCGGGC GGGGGGCGAC GCGGGCAGTG GCCGCGCGCA GGGCGTTCCA
CTCACCACGC TGAGCCCGCA GGCGCGCAGC CGGATCACCC TCGTCCTTTT CGACCAGGAC
ATCGTGCTCG TCGGTGGCCG CAGTTTTACC GCGGGTGATG ATGTCACCGT CGTGGCGGAG
ACGGCCAAGC TGGGCGGCAG CGCGCTCACC CGGGCGGACG CCGACGGCCG TTTCATCCTC
GGCTTCCGGG TGCCGGCGGA CTTCGCGGGA ACCGTCAACG TCAAGGCCAC CCAGGGCACG
GCCACGGCGA CCGACGCTCT TGATGTGACC GGACCGGAGG CGGCCGCCCG GGCGTTCCAG
GCCGCTCCGG CTCTTCCCGA TGAACCGGTG GTGAAGACGG CGCCCTCGAC ATCGCCGTCT
CCCAGGCAGC AGGACACGTC CGCGGGCACC TCGCCGAGCC CGGCCGTCTC GCCCGCGCCG
TCCCCGTCCG CCGCCGGTTC GGCGTCCGCG TCGCCGTCCC CCGCGCCGTC CGCGAGCGCC
AGCTCGACCG CCCTCACCAG GAGAACCGCA ACGCCGACCG CAACGCCGAC CGCGACCCCG
TCCGGGTCGA GACCGACCCC CGGTACGGGA TCGACGGGCA CCGGCGCCGG TGGGACGAGG
TCGGGCCTGC CGTGGCTGTC GGGGTTCCAG TCGCATCAGC TCGCGCAGCT CGTCGAGTTC
GGTGACTGGC GCGGCCGGCC GAACGACATC GTCCACGTCT ACACGAGCCG CGACCAGGGT
TGGGGCGGGC TGGTGGAGCC GGCCTGGCCG GTGGATCTCT TCAAGGGTTT CCAGGGCAAG
CTGCTGATCA GCCAGCCGAC ATTCCCCAAG GGCCAGGGCG ACAACGCCGC CTGTGCCCGA
GGCGAGTACG ACTCCGAATG GAAGAAGCTC GGCAGCTTCC TCGTGGCGCA CGGCCGGGCG
GACTCCATCA TCCGGATCGG CTGGGAGTTC AACGGAACGT TCATGTACTG GCATACGGAC
GCCGACCCGA CGGTGTTCCG GGACTGTTTC CGGAAAATCG CGACCGCCAT CCGGTCCACC
GACTCCGAAG TCAAGATCGA CTGGACCTTC AACGCGCACG CCTCCCCGGT ACCCAGCGGC
AACAGTCCCT GGGGCGCCTA CCCCGGCGAC GAGTACGTGG ACTATGTCGG CATCGACGCC
TACGACCACT TCCCCCCGTC GAAGGACGAG GCGACGTGGA ACAAGCAGTG CGAGGACGTC
AACGGCCTGT GCTACGTGAT CAGGTTCGCC CGGGAACACG GGAAGAAGGT CGGCGTCGGG
GAATGGGGGG TGGCCTCCTG CAGCGGTGAC GGCGGTGGCG ACAATCCCTT CTACATCCAG
AAGATGTTCG ACACCTTCAA GGCGGCCAGC GACGTGATGG GATACGACGC CTACTTCAGC
GACCCCACTC CCGGAAACGT CTGCTCGACC ATCACGAACG GCGGTCAGAA CCCGAAGGCG
GCCGCCATGT ACAAGAAGCT CTTCGGATCC GGCGCCAGTT AG
 
Protein sequence
MLRRRAPGDP TGACHANTER QRASMASIIR GRPAPPVGQP IPSAHRRPRS GRIHTQRRVR 
TGARIGLIAV FVVGLLQAVT PLARAAGKSP SPTPAVSPSA TSGSSRAGGD AGSGRAQGVP
LTTLSPQARS RITLVLFDQD IVLVGGRSFT AGDDVTVVAE TAKLGGSALT RADADGRFIL
GFRVPADFAG TVNVKATQGT ATATDALDVT GPEAAARAFQ AAPALPDEPV VKTAPSTSPS
PRQQDTSAGT SPSPAVSPAP SPSAAGSASA SPSPAPSASA SSTALTRRTA TPTATPTATP
SGSRPTPGTG STGTGAGGTR SGLPWLSGFQ SHQLAQLVEF GDWRGRPNDI VHVYTSRDQG
WGGLVEPAWP VDLFKGFQGK LLISQPTFPK GQGDNAACAR GEYDSEWKKL GSFLVAHGRA
DSIIRIGWEF NGTFMYWHTD ADPTVFRDCF RKIATAIRST DSEVKIDWTF NAHASPVPSG
NSPWGAYPGD EYVDYVGIDA YDHFPPSKDE ATWNKQCEDV NGLCYVIRFA REHGKKVGVG
EWGVASCSGD GGGDNPFYIQ KMFDTFKAAS DVMGYDAYFS DPTPGNVCST ITNGGQNPKA
AAMYKKLFGS GAS