Gene Francci3_4543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4543 
Symbol 
ID3907520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5427752 
End bp5429626 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content71% 
IMG OID637881876 
Producthypothetical protein 
Protein accessionYP_483618 
Protein GI86743218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTC GGCGCCGGCC GACGCGTTCC CTCGTGCTGG CCGCCGCCGT GGGTGTCGGC 
TTGGCGGCCG TGCTCGGCGT TCTGCATGCC TATCTGACCA GACCCTTCTG GTACGACGAG
ATCTGGCGCG CCCACTTCGT CAGCGAACCG GCGTCCTCGT TCTGGCCGGA GCTGACCGTC
GCCAACACCC CCTCGGCGCT GGGTTGGTTC GGCCTGACCC GGGTGGCCGG CGAGCTCCTC
GGCTGGCACA GCTGGTCGCT CCGGCTTCCC GGGTATCTTG CCGTGCCCTT GCTCGGCGCC
GCGATCGTCC TGCTCACCCG CCGTTTCACC GGCCTGGTCG CGGCGGTGTT CGCCACGTGC
TGGCTGTGCC TGAACGTGAC CTTCTTGGAT CTCGCGACCC AACTCAAGCC GTATTCCGTC
GAGGCGCTGG CCGCTGTCTG CATCGTCCTC GCCTGGTCGA CCGGATCATC TCCGCGGCCC
TCGCTGTCCC CGTGGCCTGC ACCTGTCACC GGCGGGTTCT CCCGTGCCCG GCTCCTGCGA
CGCACCGCGG CGGGCCTGTT GGCCCTGACC GCCGTGCCCG GGATCTTTCT GGTGCTGCCG
TTGGCGACCG CGGAGGTATG GCGCGGTCCG GCGCGGCGCT GGCGACTGGT GGAATGCCTG
CCCGCGGTCG CCATCAGCGT GGCACACACC GTGTTCTTCA TCGGGCACCA GTCCAGTCAG
CGTCGCGGCG ACTACTGGGA TCGCCAGTTC ATTGCCGGTC GTGGGCCGGT CGAGGCCGTA
CGCTTCATCG TCGATCAGAT GCGAGCCTTC ATTACCGGTA TGCCGCCCGG CATCGACCGG
TTCGACCCGA GCCTCATCCA TGCGACGGTG CCCGGGGGAC CGGTTGTCTC GGTGCTGCTC
GCCGTGGCGG TGTCGATCGC CGGCCTGACC GGCGTCCTCG TGCTGGCGCG ACGACCGGAC
GGCAGGCTGC TCCTCACCGC GCTCGGCGGA GCCCAGGTGC TGATGCTGGG AGCGAGCGCC
GCGCGGTACT GGCCGTTCGG GCCGACCCGG ACCAACCTCT TCGTCGTGCC ACTGATCGTC
GTTGTGATCG TGGTGGGCAC CCGGCGGCTG GTGGCCCTGT TCGTGGCGGC GGTGCGACCT
CGTATCCCCG CCGCCGGGCC TCAGCCGGCC GTACCGGCAC CGATGGCCGC GACCCCGACC
CTCCCTACCG GAGGAACCGT AACCAAGCGG TCACGATTGG ATGTTTCCGG TGTCTCGCCG
ACGTCGAGCG CCAGCGCGAA CACGATTTCT CCGACCGCCG TGCTGCCGTT GGCCGGCCTC
GCAATGATCA TCTTGCTTGG TGTCCCGCTC CTGGGAACCT CGCTGTCGGG AAGCGGGCAG
CTCTGGGATC ATCGGCACCG GACTCGTGGC CTCGATCGGA TGGTCGACGC GGCCATCGCG
GCCCGGCAGC TGCACCGCCC GGGAGACCTG GTGGTGGTCG GGGGTCGCCT CGCCCGGCCT
GGCTGGCTGT ACGCGATGGA GGCGAGTGAC GACCCGGCGC GTGAGCCGGC AGCTCTCCCC
AGCGCCGGCG CCGGGCGGGT ACCGGCCCGG GTACCACGGC AGGACACGAT CTTCTTCGGC
GCTGCGGGTA CCTCCCTGGT CGGCGAGCTG GCGAACCGCC GGGTCCCGCC GACCCGGCTG
CTCGTCTTCC TCTTCGACGC GGAGCGGCCG GCGCTCGCGC CCGACCTGGC CGCGCTGCCC
CGGGCGGGGT GGTGCGCCGG CCGTGGCTGG CAGTTCCCCC TCACCGGAAC GCTGACCGTC
TTCGATCGCT GCGCCGGCGC CGCCCCGCGA CTCGATCGGC CGCGGCGGAC CGTGGTGTCC
TACCGGACGA CCTGA
 
Protein sequence
MAGRRRPTRS LVLAAAVGVG LAAVLGVLHA YLTRPFWYDE IWRAHFVSEP ASSFWPELTV 
ANTPSALGWF GLTRVAGELL GWHSWSLRLP GYLAVPLLGA AIVLLTRRFT GLVAAVFATC
WLCLNVTFLD LATQLKPYSV EALAAVCIVL AWSTGSSPRP SLSPWPAPVT GGFSRARLLR
RTAAGLLALT AVPGIFLVLP LATAEVWRGP ARRWRLVECL PAVAISVAHT VFFIGHQSSQ
RRGDYWDRQF IAGRGPVEAV RFIVDQMRAF ITGMPPGIDR FDPSLIHATV PGGPVVSVLL
AVAVSIAGLT GVLVLARRPD GRLLLTALGG AQVLMLGASA ARYWPFGPTR TNLFVVPLIV
VVIVVGTRRL VALFVAAVRP RIPAAGPQPA VPAPMAATPT LPTGGTVTKR SRLDVSGVSP
TSSASANTIS PTAVLPLAGL AMIILLGVPL LGTSLSGSGQ LWDHRHRTRG LDRMVDAAIA
ARQLHRPGDL VVVGGRLARP GWLYAMEASD DPAREPAALP SAGAGRVPAR VPRQDTIFFG
AAGTSLVGEL ANRRVPPTRL LVFLFDAERP ALAPDLAALP RAGWCAGRGW QFPLTGTLTV
FDRCAGAAPR LDRPRRTVVS YRTT