Gene Francci3_4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4525 
Symbol 
ID3907502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5398829 
End bp5400718 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content70% 
IMG OID637881858 
Producthypothetical protein 
Protein accessionYP_483600 
Protein GI86743200 
COG category[S] Function unknown 
COG ID[COG5650] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.48378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCA TGGCCATGTC CCCGGCGTCC CCCGTCCGCG TCAGCCCGGC CCGGGAGGAC 
CCGGTCGTCG CGGGGGCAAG CCAGCTCATC GGTGGGCCGC CGGGTGAGCA CGCGGCCGTG
CCCGACCGGC GTGGTTGGCT GACCCCGCTG CGGGTGCTGA TGGCAATGGT CATCGTGGCC
GGGGTCCTCG GCTACACCCA GAAGATCAGT TGCCGGGACA CCCGGAACTG GACGCATGAG
CACCAGTACA CGGCGTTGTG CTACTCCGAC ATCGTCGCGT TGTACAGCCA GGAGGGGCTG
GTCGACGGGA AGATCCCCTA TCTGGACTAC CCGACGGAAT ATCCGCCGTT GATCGGCGCC
ACCATGCAGC TCGTCGCCTG GCCGGCCGGT CTCGCGAACG GACCGCGGGC GGTCTACCGG
GAGGAGAACG GCGAGCGCGT CTTCGACCAT TACACGATGG ACATGCGGTC GGCCGTCTTC
TACGACCTCA CCGCCCTGCT GTTCCTGATC GTGGCCTCGG TCGCGGTGTG CTGCACCGCG
CTGGCGGCCG GATCGAGGCG GATCTGGGAC GCCGCTCTGT TCGCGCTGGC GCCGACCCTC
GTGCTGCATC TGCTGACGAA CTGGGACATC ATCGCGGTGG CGTTCGCGGC GGGCGGCCTG
CTTGCCTGGT CCAGACGGGC TCCGAAGCTC GCCGGGATGC TAATCGGACT CGGCATCCTC
ACGAAGCTCT ATCCGGCCCT GTTCCTGCTA GGGTTGGCCT TTCTCTGCCT CCGGGCCGGG
AGGCTGCGCG AACTGGCCCG GACCGTCTGC GCGGCGGCGG TGACGGTGCT GGTGGTCATC
GTCCCGCTAT GGCTGACCGC CGGCTACTTC AACGCGGAGA ACGCCCGCGT CGGCGACAGC
ATCTGGTCGA CCTTCTGGCG AGGTGGTGAC TGGATCCGGC TGGTCGGTGG TGGTCCGGAC
GGGGCCCGCA ACGCCATCGC CCGGTTCTTC GATCTGAACT CCGACCGCGG CGCCGACTGG
GACTCGCTGG CGTTCGGCGC GACCTGGCTT GCCGGCCGCT ACGACCCGGC CTGGTTCGGA
GCACTGCACC TGGTGGTCTC CGCGCTCACT GCCGTCGTCC TCGTCGCGGT GGCCGCGGCC
GCGATCCATC ATCGGCCCGA TGCGCGCCGG CTGGTCATCG CGGCGGCCGC GGCCGGCTGG
GTGACGATCG TCGTCGCCCT CCCGATGATC CTGCAGGGCT TGCGGAACAA CGGCCTGCCC
ATCTCCACGT TGAACATCAT CACTGGGGCG GCCCTGCTGG TCTCGGTGGT CGCCATCGGG
TTACTCACCT GGCTCGCACC GCGCCGGCCC CGTCTGCCGC AGGTGCTGTT CCTGCTCGTC
GTCGCCTTCC TGCTGACCAA CAAGGTCTTC TCACCCCAGT ACACGATCTG GCTGCTACCG
CTGGCGGCAC TGGCTCGGCC CCGATGGAGG CTGTTCCTGC TGTGGCAGGT CTGCGAGGCG
TGGGTGCTGT TCACCCGATT CATGCACTTC ATCTACAACG ACACGCAGGG TCGTCACGGC
ATCGACCGGG GATGGTTCGT CGGCGCGGTG GCCCTGCGCG ACCTGGTGCT GCTCGTCCTG
GCCGGATTCG TCGTCCGGGA GATCCTGCAT CCGCATACCG ACATCGTGCG GACCGGCGGA
ATGGCGTACC CGGGTGATCC GCGTGTCGCG GTCGGCTCCG CTGCCGTCGG CGATCCGGAC
GCCGTTGACG ACCCTGCGGG CGGGGTGCTG GACCATGCCC CCGATGTCCG CGGCGGCCGG
CCGGCGTGGG TTACGGCGGC GGCCACCGGC GAGCCGGCCG GAGGCCCGGG CGCCGGCAGC
GCCGACGGGG CCGGGTCCGG CGGTGCATGA
 
Protein sequence
MTGMAMSPAS PVRVSPARED PVVAGASQLI GGPPGEHAAV PDRRGWLTPL RVLMAMVIVA 
GVLGYTQKIS CRDTRNWTHE HQYTALCYSD IVALYSQEGL VDGKIPYLDY PTEYPPLIGA
TMQLVAWPAG LANGPRAVYR EENGERVFDH YTMDMRSAVF YDLTALLFLI VASVAVCCTA
LAAGSRRIWD AALFALAPTL VLHLLTNWDI IAVAFAAGGL LAWSRRAPKL AGMLIGLGIL
TKLYPALFLL GLAFLCLRAG RLRELARTVC AAAVTVLVVI VPLWLTAGYF NAENARVGDS
IWSTFWRGGD WIRLVGGGPD GARNAIARFF DLNSDRGADW DSLAFGATWL AGRYDPAWFG
ALHLVVSALT AVVLVAVAAA AIHHRPDARR LVIAAAAAGW VTIVVALPMI LQGLRNNGLP
ISTLNIITGA ALLVSVVAIG LLTWLAPRRP RLPQVLFLLV VAFLLTNKVF SPQYTIWLLP
LAALARPRWR LFLLWQVCEA WVLFTRFMHF IYNDTQGRHG IDRGWFVGAV ALRDLVLLVL
AGFVVREILH PHTDIVRTGG MAYPGDPRVA VGSAAVGDPD AVDDPAGGVL DHAPDVRGGR
PAWVTAAATG EPAGGPGAGS ADGAGSGGA