Gene Francci3_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1539 
Symbol 
ID3904771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1849264 
End bp1850724 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content75% 
IMG OID637878876 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_480644 
Protein GI86740244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.407284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATG CCGCTGACCC GTTTCGTCGG CGTCGCCGTG AGCCCGGTCT GTCCCGGCGC 
GGAATTCTCG GTCTCGCCGG TGCCGGTCTG GGTGGAGGGG CGGAGGCGCT GCTCGCCGCC
GCGCCGAGCA GCGCGTCCGC TGGCCCGGCC GGGCCAGCGG ACGCGGGGGC CCGACCCGGT
GCGGAAGCGC GGGCCCTCTC GGCGCCGGGG ACGCGTGTCG TGCTCGTGCC CCGGACCCTG
GCCGCCGTGA CGAGGGGAGC GGGGCCCGCG ACGTCGGATT TCGACATCGG TTACGTGGCG
GTCCGATGGA CACGTGACGG GAACGATCGC GCGGCGGATG GCAGGGCCGG TTCCGGGGGT
GCCGGCACGG GGACCGCCGG GACCGCCGAT GCCGGCGGTG CCGTTTCTCG GGGCGCCGAG
ATCCGGCTGC GCCAGCCGGG GGGTGGGTTC GACGCATGGC GGCCGCTGGC GGTCGGCTGT
CCGGCGGAGC GGGACGATTC GCCGGCCGGG GGGAGGAGCG CCGCCGTGCT GGTCGCCGCG
CACCGCGCGA CCGGCTACGA ACTGTGGTTG CCACCCGGCG CGACGTCCGT GACGTCCACC
GCCCTGAACA CGACGTCGGG GCCGTTGCAG CGGTTGACGG CCCCCATTGT GTCCGCCTAT
GCCGCCCTTT CCGCGATCTC GGCTTCCCGC GCCGCCCTTT CCGCGCCTTC GGCTTCCCGC
GCCGCCTGGT CGCCGGGGCT CTTCGCCGCG CCCAGGTCGT CGCCGCCGCC CCGGGTCGCG
ACCGCGTCCG GGGTGTCGGC GGTCCCCGCC GCGCCGGTCG CCCCCGTCCG GCTGCCGGCG
ACGCTTGACC TGCGCTACCT GCCCCGGGCC GCGTGGGGCG CGGACGAGTC GCTACGGCTG
TCCCCCTCGT CCGGCAGCGG TTGGAAACCG ACGTACCACC CGGGCCAGGT GGTCACGGTT
CATCACACGG TGACGCCCAA CGACGATCCG AACCCCGCCG CCACCGTGCG GGCGATCTAC
CACTTCCACA CGGTCGAGCG GGGATGGTCG GACATCGGGT ACCACCTCCT CATCGACGAG
GCGGGCACGC TCTACGAGGG CCGGTGGTCG GGAACGGACA GCGTTCCCGG CCACCGGGAG
GACGGCTACG TGGTGACGGG CGCGCATGTG GCGGACTTCA ACGCCGGCAA CGTCGGCGTC
GCGCTGCTCG GTGACCTGCG GACGCGCATC CCCACCGCCG CCGCCCGTCG CACGCTCGTC
CTGGTGCTGC TCGCGTTGAC CGGAGCGCAT CATCTCGACC CGCTCGGCAC CGTCCACTAC
GTCAATCCGG TGAGTGGCAG GCGTCGTACC GTCCCGGCCG TCAGCGGGCA CCGCGACTGG
ATGGCCACCG AATGTCCGGG CGGCACGGCC TACACCGCCC TGGCAGGTGT GCGGACGGAC
GTCGCCCGGC AGCTGATGTG A
 
Protein sequence
MSHAADPFRR RRREPGLSRR GILGLAGAGL GGGAEALLAA APSSASAGPA GPADAGARPG 
AEARALSAPG TRVVLVPRTL AAVTRGAGPA TSDFDIGYVA VRWTRDGNDR AADGRAGSGG
AGTGTAGTAD AGGAVSRGAE IRLRQPGGGF DAWRPLAVGC PAERDDSPAG GRSAAVLVAA
HRATGYELWL PPGATSVTST ALNTTSGPLQ RLTAPIVSAY AALSAISASR AALSAPSASR
AAWSPGLFAA PRSSPPPRVA TASGVSAVPA APVAPVRLPA TLDLRYLPRA AWGADESLRL
SPSSGSGWKP TYHPGQVVTV HHTVTPNDDP NPAATVRAIY HFHTVERGWS DIGYHLLIDE
AGTLYEGRWS GTDSVPGHRE DGYVVTGAHV ADFNAGNVGV ALLGDLRTRI PTAAARRTLV
LVLLALTGAH HLDPLGTVHY VNPVSGRRRT VPAVSGHRDW MATECPGGTA YTALAGVRTD
VARQLM