Gene Francci3_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1030 
Symbol 
ID3906272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1221773 
End bp1223491 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content72% 
IMG OID637878363 
Producthypothetical protein 
Protein accessionYP_480142 
Protein GI86739742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTCC CTCCCCCTCC CGGCCCTGAC CCGGCCGGCC CCCACGACCC CACCAACCGT 
GACGATCGGC CCGGCTCTCG GGCGGCGCGG ATGCGGATGC CGCTCGCCCG TGCGGTCGTC
GAAGCGGTCG CCGTCGAGAA CGGCGTGTGC GTCCGGCCGA TGGCGATGCG GCGCACGAAC
CTCGACACCG GCGAAACCGA GATCATCCCC GTACCCTGCG GCGCCACCCT GGCCAGCAAG
TGCCCGACCT GCGCGGAGAA GGCCCGGCGG CTGCGGATGG CGCAGTGCAG GGCAGGGTGG
CACCTCGACG ACGAACCGCT TCCCGACCCG GACCCGCCGT CCGATGACGC GAAGGTCCTT
GCCGGCTTCC GCGCGGATCT CGAAGTCGCC CGGCAGGACG CGGAACGGGA CAGTGACCCG
GCCGGCGTCG CCGAGATCGA CGCGCTGATC GGCCAGGTGG ACGAGGAACT CAACGCCCTG
GGCGTGCGCG GCAAGACTGC CCCGGACAAC CGGGACCGGC CCCGCCGTGC CCGCTCGACC
CGCCGGCGGC AGGACGCTGC CGACCTGCCG CGGTTGCCCG TGGAGAAGCG CACGATCGGC
CGGACGTATG AGGCGGCGGA CGGCACGACC TGGCGGCCGT CGATGTTCCT CACGCTGACC
TGCGACACCT ACGGGCGGGT GAACTCCGAC GGAGCTCCGG TGGACCCAGC GTCGTATGAC
TACCGGCGGG CGGCCCGCGA CGCGATCCAC TTCCCGAAGC TGATCGACCG CTTCTGGCAG
AACCTCCGCC GTGCGGTCGG CTGGGACGTG CAGTACTTCG CCGCGCTCGA ACCGCAACGG
CGGCTCGCCC CGCACCTGCA CGCGGCCCTC CGCGGAACCG TGCCACGGGC CCTGCTGCGG
CAGGTGGCGG CGGCCACGTA TCACCAGGTC TGGTGGCCAC CGTCTGGCCA GCCCGTCTAC
CTGGACACGG CACTCCCGAC CTGGGCAAGC GAAACGGGCG GATACGTCGA TCCGGCTTCC
GGCCGGCCGC TGCCCACCTG GGATGAGGCG CTCGACGCCA TCGGAGACGA GGACGAACCG
TCCCATGTGG TGCGCTTCGG CCCGCAACTG CAAGCGGACG GCTTCACCTC GAACTCGGCG
CACACCGGCC GGATGATCGG CTACCTCTGC AAGTACCTGA CCAAAAGCCT CGACGCCTGC
CACACGGCCA CCACCGACCG GCAACGGCGC CACGTCGACC GGCTCGCCGA AGCCCTGCGC
TACGAACCCT GCTCACCCAC CTGCGCGAAC TGGCTCCGCT ACGGCGTCCA GCCGAAGAAC
GCGAAACCGG GCCTCGTCCC GGGACGCTGC CGCGGCAAGG CACACCGACG GGAGACCCTC
GGCTTCGGCG GCCGGCGGGT CCTGGTGAGC CGCAAGTGGT CCGGCAAGTC GCTGACCGAC
CACAAGCATG ATCGCGTCGC GTTCATCCGG GAGCAGCTCG AAGCGCTCGG CCACACCGCC
ACCGGCCCGG CCGCGGCAAC CGACACCGAC CCGGCTCGCA CCGCCTGGAC GATGCTCCGG
CCCGGCGACC CGGCCGCACC TCGCCGCGAA CACCTGCTGT TGCAGGCCGT CGCGCAACGC
CACGCCTGGC GCGCACAGCT CGACGCGGCC CGACGCGCCG CACCCGACGA ACTTCCGGCA
ATCGGCCTCG GCCCACCGGG CAGGGCACAA GCCGCCTGA
 
Protein sequence
MTLPPPPGPD PAGPHDPTNR DDRPGSRAAR MRMPLARAVV EAVAVENGVC VRPMAMRRTN 
LDTGETEIIP VPCGATLASK CPTCAEKARR LRMAQCRAGW HLDDEPLPDP DPPSDDAKVL
AGFRADLEVA RQDAERDSDP AGVAEIDALI GQVDEELNAL GVRGKTAPDN RDRPRRARST
RRRQDAADLP RLPVEKRTIG RTYEAADGTT WRPSMFLTLT CDTYGRVNSD GAPVDPASYD
YRRAARDAIH FPKLIDRFWQ NLRRAVGWDV QYFAALEPQR RLAPHLHAAL RGTVPRALLR
QVAAATYHQV WWPPSGQPVY LDTALPTWAS ETGGYVDPAS GRPLPTWDEA LDAIGDEDEP
SHVVRFGPQL QADGFTSNSA HTGRMIGYLC KYLTKSLDAC HTATTDRQRR HVDRLAEALR
YEPCSPTCAN WLRYGVQPKN AKPGLVPGRC RGKAHRRETL GFGGRRVLVS RKWSGKSLTD
HKHDRVAFIR EQLEALGHTA TGPAAATDTD PARTAWTMLR PGDPAAPRRE HLLLQAVAQR
HAWRAQLDAA RRAAPDELPA IGLGPPGRAQ AA