Gene Francci3_2918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2918 
Symbol 
ID3903982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3438337 
End bp3439941 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content71% 
IMG OID637880239 
Productvon Willebrand factor, type A 
Protein accessionYP_482005 
Protein GI86741605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.306616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGAC GGCCGCCCGC GGCCCTGCTC GCGTGTCTCC TGACGGTGCT GCTCGTGGCC 
GCCTGCTCGG GCGGAGGCCC GGGACCGGCG GACGGGTCCG GCACCGGCGG GTCCGCAGGC
GAACTCACCG TCCTCGCCGG CTCCGAACTC CAGGACATCC AGCCGCTGCT TCCCGATCTC
ACCCGGGCGA CCGGCGTCAC CCTGCGGCTG AGCTACACCG GCACCCTGGA CGGCGCCGAC
GCGATTACCA GGGGCACCGC CCAGGCCGAC GCCGCCTGGT TCGCCTCCGA CCGGTACCTG
CGACTGCTGC CGAACGGCGC GACGTCCGTA GCCGCGCGAT CGTCGATCAT GCTGTCACCG
GTGGTGTTCG GGGTCCGACG CAGCCTGGCC CGGCAGTTCG GCTGGACCGG CAACCCCAAC
GTGACCTGGG CCGACATCGC CGCGAAGGTG GCGGCCGGCC AGCTCTCCTA CGCCATGACG
AATCCCGCGG CGTCGAACTC GGGTTTCTCC GCCCTGGTCG GGGTGGCGGC GGCACTCGCC
GGCACCTCGG ACGCGCTGCG CCCGCAGGAC ATCAGACCTG CCCAGCTGAC CTCGTTCTTC
TCTGGCCAGG CACTGACCGC GGGCAGTTCC GGTTTCCTGA CCGACGCCTA CGCCCGTTCC
CAGGACACTC TCGGCGGCAT GATCAACTAT GAGTCGGTGC TGCTGGCCCT GAACGCCGGG
CACCGGCTGC GCGAACCATT GGAGTTGATC TATCCCCGGG ACGGGATCGT CACCTCCGAC
TACCCTCTCC TGCTGCTGCG GCCGGACAAG CGTGATCTCT ATGACCGGGT CGTCAGCTGG
CTGCGCCTGC CGCGGACGCA GCACCGGCTG CAGACTGCCA CGAGCCGCCG GCCGGCCCTG
CCCGGGGTGG CGTTGGACGC CCGGTTCCCG ACGCGGACGC TCACCGAGCT GCCCTTCCCG
GCGAGCCGGC AGGTCGCCGA CCAGCTGCTG TCCGCCTACC TCGACCGGTT CCGGCGGCCC
AGCCACGCCA TCTTCCTGCT CGACGTGTCC GGGTCGATGG CCGGATCACG GATAGCGGCG
CTCCAGGCGG CGCTGCGCGG ACTCACCGGT GCGGACGACA CGTTGTCCGG CAGGTTCGCC
CGGTTCCGGG GTCGCGAGAA GATCACGATG ATCACCTTCG CCGGCCGGGC CAACGACCCC
GTGGACTTCG CCGTGAACGA CCCGCGGCCG GGCTCCGCGG ATCTGGCGGG CGTCAACACG
TTCGTGGACG GCCTGCGGCT CCAGGACGGC ACCGCGATCT ACTCGGCGCT GGAGGCCGGC
TACCGGGCTG CCGGCGCGGC CGTTGAGGCC GACCCCGGCT ACCTCACGTC GATCGTCCTG
ATGACCGACG GGGAGAACAA CTCCGGTATC TCGGCCGCCG ACTTCCGCTC CTCCTATCAG
CGGCTGCCGG CGGCCGCGCG TGCCGTGCGT ACCTTCACGA TCGCCTTCGG GGAGGCCGAC
CCGGCGGCGC TGCGGGATAT CTCCGCCGAC ACCGGTGGCG CGGTGTTCGA CGCCCGTACC
TCGTCGCTGG CGGACGCGTT CAAGGACATC CGTGGCTACC AGTGA
 
Protein sequence
MNRRPPAALL ACLLTVLLVA ACSGGGPGPA DGSGTGGSAG ELTVLAGSEL QDIQPLLPDL 
TRATGVTLRL SYTGTLDGAD AITRGTAQAD AAWFASDRYL RLLPNGATSV AARSSIMLSP
VVFGVRRSLA RQFGWTGNPN VTWADIAAKV AAGQLSYAMT NPAASNSGFS ALVGVAAALA
GTSDALRPQD IRPAQLTSFF SGQALTAGSS GFLTDAYARS QDTLGGMINY ESVLLALNAG
HRLREPLELI YPRDGIVTSD YPLLLLRPDK RDLYDRVVSW LRLPRTQHRL QTATSRRPAL
PGVALDARFP TRTLTELPFP ASRQVADQLL SAYLDRFRRP SHAIFLLDVS GSMAGSRIAA
LQAALRGLTG ADDTLSGRFA RFRGREKITM ITFAGRANDP VDFAVNDPRP GSADLAGVNT
FVDGLRLQDG TAIYSALEAG YRAAGAAVEA DPGYLTSIVL MTDGENNSGI SAADFRSSYQ
RLPAAARAVR TFTIAFGEAD PAALRDISAD TGGAVFDART SSLADAFKDI RGYQ