Gene Franean1_0635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0635 
Symbol 
ID5669052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp737094 
End bp738779 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content78% 
IMG OID641239562 
Producthypothetical protein 
Protein accessionYP_001505000 
Protein GI158312492 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000516706 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.674007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC CGACCCCCCG GACGCTGCCG GGGTTCACAC CCGCCGCATG GGGACTGCTC 
GCCGCGACGG CCGCCTGCGC CCTCGGGGCG GTGCTCCTGC GCTACGCGGA GCTCGCCGCC
TTCGCCGGTG CCGGCGCGGC GGCGCTGTTC ACCGCGGTGG CAGCGGTGGC CCGTCCGCCG
CGGGTCACCG TCACGACACG GGTCACCCCT GCCGCGGTCA CCCGCGGGGA CGACGCCGCC
CTGGTCATCT CGATCGTGAA CCACTCCCGA TGGACGTCTC CCTCGTTCGC CCTACGCCTG
CCAGCCGAGC CGGCCCAGCC GGGCGAGGTG CCCGCCGAGC CCGCCGAGCC CGCCGAGCCC
GGCGGGTCTG TCGGGTCCGA CGGGCCCGAG ATCGCCGTGG ACATCCGCCC GTTGCGCGGT
GGCGCCAGCC GGGAGATCGT CCTGCCGCTC GACACCGCGG CTCGGGGAGT TCGCCGGATC
GGGCCCCCGC AGGTGCACCG GTCTGATCCG TTCGGGCTGG CCCACCGCCA CCAGTACCTC
GGCACGAGCC TCACCCTGCG GGTCCGCCCC CGCGCATACC CGCTGGTCCC ACCGCCGGCC
GCCCCGGCCC GTGACCCCGA CGGGCAGAGC GGACGTGGCG CGTCCGGGGG GCTGATGTTC
CACACCCTGC GCGAGTACAC CCCCGGGGAG GACCTGCGCC TGGTGCACTG GGCCGCCAGC
GCGCGCACGG GCACGCTGAT GGTCCGAACG CACCTGGACC CCAGTGAGCC CGCCTCCACC
GTCGTGCTGG ACACCCGCCG GCGGGCCTAC CCGCCCGGCC CGGTCGGGGC GGCCGTCTTC
GAGGACGCCG TCGACGTCGC CGCGTCCGCG GTGCTCGCCT GCGCCCGCAA CTCCTACGGC
GTCCGCCTGG TCACCTCGGG CGGGGTGCGG ATGACCGGTC GCCGACGCTC CACCGACGCC
GAGTCCCTCC TCGACGAGCT GGCCGACGTC CGGCCGGACG AGGGCGTGAC CCTGGATGTC
CTGCGTACCC TGCGCCGCGG GCCCGTCGGC ACCCTCGTCC TGGTGACCGG CGCGCTCGAC
CGGGACGCGG CGGCGGCGCT CGCCCCGGTG GCCCACGTGT TCGGCCAGGT GATCGTGCTT
CGGATGGGCC CGCGCAGCGA GGCCGCCGCG CTGGCCCGGG GCCGGCGCGC CCTCGGCGAC
CGCCCACGCC TGCGCCCGAG TGCGGAGGCG GTGGCACGGG CCCGCGCGGA ACGAGGCGGC
CCGGTCGCCC CCGCCGGGCC CACCCGCTCG ACGGGACCGG CGGGCGCGGC CCGTACGGCA
GGTGCGACCC GCATGGCCGG TGTGGCCGGT GCCGCTGGTG TGGCGGGGTC GGTTGGCATG
GTCGGCACCG CCGGTTCGGG CCGGGTGCGG ATGATCCACC TGGGCTCACC CGCCGACCTG
ACCGACGTCT GGCCTGCCGC GCCCCTCCCG CCCCGGGCGC CGGCCGCGGA CCAGGCGGCC
GCGAGTCCAG CGGCAGTGGG CTCGGCGGGA TCGGGTTCGG CGGGATCGGG TTCAGGGGGA
TGGGGTCAGG CCGAAACCGA GCCGGACCTG GCGGGACCGC TGCTACCGGT GGCCTCCCTG
GCGTCGGCCG GCCCCGGCCC CGGCCCCAGG CCCAGGACTG GGCGCGGCGC GGGCGGCGGA
TCGTGA
 
Protein sequence
MSEPTPRTLP GFTPAAWGLL AATAACALGA VLLRYAELAA FAGAGAAALF TAVAAVARPP 
RVTVTTRVTP AAVTRGDDAA LVISIVNHSR WTSPSFALRL PAEPAQPGEV PAEPAEPAEP
GGSVGSDGPE IAVDIRPLRG GASREIVLPL DTAARGVRRI GPPQVHRSDP FGLAHRHQYL
GTSLTLRVRP RAYPLVPPPA APARDPDGQS GRGASGGLMF HTLREYTPGE DLRLVHWAAS
ARTGTLMVRT HLDPSEPAST VVLDTRRRAY PPGPVGAAVF EDAVDVAASA VLACARNSYG
VRLVTSGGVR MTGRRRSTDA ESLLDELADV RPDEGVTLDV LRTLRRGPVG TLVLVTGALD
RDAAAALAPV AHVFGQVIVL RMGPRSEAAA LARGRRALGD RPRLRPSAEA VARARAERGG
PVAPAGPTRS TGPAGAARTA GATRMAGVAG AAGVAGSVGM VGTAGSGRVR MIHLGSPADL
TDVWPAAPLP PRAPAADQAA ASPAAVGSAG SGSAGSGSGG WGQAETEPDL AGPLLPVASL
ASAGPGPGPR PRTGRGAGGG S