Gene Francci3_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3080 
Symbol 
ID3904282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3649750 
End bp3651105 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content72% 
IMG OID637880401 
Producthypothetical protein 
Protein accessionYP_482166 
Protein GI86741766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.734397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACA GTGAGGCGGT TGCGGGTAGC CAGCTGACGG ACTGGATCTC TCTGGGGGTC 
CTGACGTCGT TCGTCCCGCG TGATGCGGTC GATGAGGCGA TCGAGGCGAC CGGGGCGGGT
GCCCGGCGCT CCGACACGAC GATCCCCCCG CAGGTCGTGG CCTATTTCGT GATGGCGCTC
GCGTTGTTCG CGGACGACGA CTACGAGACG GTCGCCCGCC GGCTCGCCGC CACGCTCACC
GATCTCGACG TGGTGGGGCC GCGGTGGGAA CCGACCTCGT CAGGGCTGAC CAAGGCCCGC
CAGCGGCTCG GTGCGGCGCC GCTGGCCGAA CTGTTCGGTC AGGTCGCCGG GCCGGTCGCG
GACCTGGACA CGGTCGGGGC GTTCCTGAGC CGGTGGCGGC TGATGAGCAT CGACGGGCTG
GAATGGGATG CCCCCGCCTC GAAGGAGAAC ATCGCCGCGT TCGGCCTACC GGCCGGCCGC
GTCGACGCGC CAGGGGTACT GCCGAAGGTC CGCGCGGTGA CCGTGTCCGA GTGCGCCTCG
CACGCGCCGG TCCTGGCCGC GTTCGGCCCG GCCGGTGGGG CGAAACCCGC CAGCGAGCAG
GCACTGGCCC GGACTGTCTA CCCGCGGCTG GCCTCGGACT GGCTGCTGCT CGCGGACCGT
AACTTCTACT CGTGGGCGGA CTGGTGCACC GCGGCGGACA CCGGTGCGGC GCTGCTGTGG
CGGGTCAAGG CCACCCTGCG CCTGCCCCCG CTGCGCGCGT TGTCCGACGG TTCCTATCTG
ACCGTGCTGG TCAACCCGAA GGTCACCGGG AAGGCCCGGG AGACCCTCGT CACCGCGGCC
CGCGCCGGCG CGCCGCTGGA CCCGACGAAA GCCCGTTACA CCCGCCTCGT CGAGTACGAC
GTCCCCGACC GTGAGGGCGA CGGGAAACAC GAGATCACCG GCCTGCTCAC CACGATCTGT
GACCCGCGGG AGGCGACCGC GACCGCTCTG GCCGGGGCCT ACCGGCAAAG ATGGGAACAC
GAGGTCGCGA TCGAAGACGC CAAACAACTC GTCGGCGTCG GCCAGGCCCG CAACCGGCTC
GCCACCGCCG CGCAACGCGC CGTCCCGTTC GGCCTGACCT GCCAGACCCT CGCCTTCACC
TGGTACCTCA CCACCGGCCA CCACCACGAC GACGCCGCGG AGCACCGCGC CCGCGCGCCC
TGGTACACCA CCAAGACCCG GCCCTCGACC GCCGACCTGC TCGCCAAGCA CCGCCGCGTC
CTCATCGCCA CCAAATACCA GCCCGCTCAC CCCGAACAGC CCACCCCAGC CGAAATCCAC
ACCCTCCGAC TGGCCTGGGA GATCACCGCC GCATAA
 
Protein sequence
MADSEAVAGS QLTDWISLGV LTSFVPRDAV DEAIEATGAG ARRSDTTIPP QVVAYFVMAL 
ALFADDDYET VARRLAATLT DLDVVGPRWE PTSSGLTKAR QRLGAAPLAE LFGQVAGPVA
DLDTVGAFLS RWRLMSIDGL EWDAPASKEN IAAFGLPAGR VDAPGVLPKV RAVTVSECAS
HAPVLAAFGP AGGAKPASEQ ALARTVYPRL ASDWLLLADR NFYSWADWCT AADTGAALLW
RVKATLRLPP LRALSDGSYL TVLVNPKVTG KARETLVTAA RAGAPLDPTK ARYTRLVEYD
VPDREGDGKH EITGLLTTIC DPREATATAL AGAYRQRWEH EVAIEDAKQL VGVGQARNRL
ATAAQRAVPF GLTCQTLAFT WYLTTGHHHD DAAEHRARAP WYTTKTRPST ADLLAKHRRV
LIATKYQPAH PEQPTPAEIH TLRLAWEITA A