Gene Francci3_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0984 
Symbol 
ID3905840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1161853 
End bp1163328 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content74% 
IMG OID637878318 
Producthypothetical protein 
Protein accessionYP_480097 
Protein GI86739697 
COG category[S] Function unknown 
COG ID[COG5650] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAA CCCGCGCCGC GCCACGCCTC AGGCCCGCCC TGGCCCTGGT CGGGGCGGGT 
GTCGGGCTCG CGGGTCTGCT CGCGACGCAG TGCGCCGTCC TTGCGGCGCC GGGATACCTG
CCTGGCCGGG CAGGGGTGCT CTACCCCGTC ATGCTCTGGT GGGCGCTCGG TGTCGCCACG
GCCGCGCTGC TGGTACACGC GGCCCCGCGC CGGTTGGCCG TCGCCATGCT GCTCGGCGGA
ATCGTCGCCA TCCACGCCGT AGCGGCCACC ACCGGCCCGC AGCTCTCCGA CGACCTGTAC
CGCTACGCCT GGGACGGTCG GGTCCAGGCC GCCGGGATCG ACCCCTACCG GTACGGCCCG
CTCGCTCCGG AACTGGCCCG GCTGCGCGAT CGCTGGCTGT TCCCGGATCC GGCCGGATGC
GCGGCGATCG GGCGCGGCCC GCACTGCATC CGGCTGAACT ACCCGCGCGC CCACACCATC
TACCCGCCGG TGGCGCAGGC GTACTTCACC GCCGTTCATG TCCTGCCGGG GCCGCCACGC
GAACACAAAC TCCAGCTCTA CGCCTCGCTG ATGTCCCTGG CGCTGGTCGG GCTGATGATG
CGGATGCTGG TGGCGCGAGG CCGCGATCCG CGCCACGCGG CCTTCTATGC GTTCTCGCCG
CTCGCCGGCC TGGAGATCGG TTCCGATGCG CACGTCGACG TCCTCGGGGC CGTGCTGGCG
CTCAGCGCTC TGGCGGTCCT CACCGCCCGG TCCCGCCCGC TGCGGACGGG GGTCGCCGGG
GCGCTGCTCG GCGGCGCGGT AGCGGTGAAG CTGTACCCGG CGTTGCTGCT GCCCGCGGCG
GCCCGGCGCC GGCCCGTCAC GCTCGTCGGA GCCGCCGCCG GGGTGGTCGT GCTGTCCTAC
CTGCCGCACA TCCTCGCCGT CGGCACACAG GCGCTGGGCT TCCTCCCGCA GTACCTCGAC
GTCGAGGGCT ACGGGGAGGG CAGCCGCTTT CTCCTGCTCG CCGGGCTGCT GCACCTGGAC
GGATCCGCAG CGAAGGCGGC AGCGGCCACC CTGCTGGCGG CGGTCACGGT CGCCGTGCTG
CGGACCGATC CACAGCGGGT TCCCGTCGAA CGGGCCGCGC TGTGGCTGGT CGGTGCCGCG
TTCCTTGTCG CGACCCCGGT CCAGCCCTGG TACGGGGTGC TGCTGGCGGC GCTCGCCGTC
ATCGCCGGGC GGCTGGAATG GCTCGCCGTC GCCGCGGCCG CGCACCCCGT CTACGTCTCG
CTGTTCACGG ATCTGCCCGG TGACGCGTGG ACTTTGCGGG TGTATTCCTA CGCCGTCGGT
GGTGGCGTCG TGCTCGCCGC GACCGGTCTG CGCCGGTGGA CCGGCCGACG GCTGGCGGTC
GACGAGCGCG CTGCCGCACC CGCGGCCCGA TCCGTCGTCG AACCCCGGCC CGAGTCGTCA
CCGGGCGCAG GGACTGAAGG GCAGGTGAGG GTGTAA
 
Protein sequence
MPRTRAAPRL RPALALVGAG VGLAGLLATQ CAVLAAPGYL PGRAGVLYPV MLWWALGVAT 
AALLVHAAPR RLAVAMLLGG IVAIHAVAAT TGPQLSDDLY RYAWDGRVQA AGIDPYRYGP
LAPELARLRD RWLFPDPAGC AAIGRGPHCI RLNYPRAHTI YPPVAQAYFT AVHVLPGPPR
EHKLQLYASL MSLALVGLMM RMLVARGRDP RHAAFYAFSP LAGLEIGSDA HVDVLGAVLA
LSALAVLTAR SRPLRTGVAG ALLGGAVAVK LYPALLLPAA ARRRPVTLVG AAAGVVVLSY
LPHILAVGTQ ALGFLPQYLD VEGYGEGSRF LLLAGLLHLD GSAAKAAAAT LLAAVTVAVL
RTDPQRVPVE RAALWLVGAA FLVATPVQPW YGVLLAALAV IAGRLEWLAV AAAAHPVYVS
LFTDLPGDAW TLRVYSYAVG GGVVLAATGL RRWTGRRLAV DERAAAPAAR SVVEPRPESS
PGAGTEGQVR V