Gene Franean1_0380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0380 
Symbol 
ID5668804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp455358 
End bp457001 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content71% 
IMG OID641239312 
Producthypothetical protein 
Protein accessionYP_001504752 
Protein GI158312244 
COG category[S] Function unknown 
COG ID[COG5298] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.396423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.044592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCCCG CCGAGGGGCG CTTCGCCTCA CCGCCGGCTC CACCGGCGCC GCCGCGCACC 
CCGCCGCCAC GGCGGGCCGC GCGGCCCGTC CCGGGCACGA ACCAGCGCCC GCCCGCGGCA
CCGGACGGCG CCACCGGGCC CGGGGCCGCG TCGGCGCCGT CCGGCCAGGC GACCGGGACG
GGCACCCTGA TCCTGTACGA CACCACCGGC GCGTGGGGGT GGCTGGGCGA GCAGTACGCC
ATGCAGGCGG CCAACCTGGC CTCCCGGTTC GGCACCTGGC AGGCCCGTCC GGTCAGTTCG
TACACCGCGG GCCAGATGTC CGCGTACGCC GCGGTGGTGT ACGTCGGGTC GACCTACGAC
GAGCAGGTCC CGACGGCGTT CCTCACCGAC GTGCTGGCGG GGAACCGGCC CGTGGTGTGG
ATGTACAACA ACATCTGGCA GCTCACGTCG CAGGCGCCGA CCTTCCCGAC GACGTACGGG
TGGAACTGGT CCGGCTTCGA CACGTCGGCG ATCGGGACCG TCAGCTACAA AGGGACCGAT
CTGACCCGGT ACACGGCGAA CGCCGCCGGG ATCATGAACT ACGCCTCCGT GGACACCACC
AGGGCCACGG TGCTCGCCGA GGCGGTGCGC GGTGACGGCA CCCGCTTCCC CTGGGCGCTG
CGGTCCGGGA ACCTGACCTA CATCGGCGAG ATCCCGTTCG CCTACGCCGA CATGACCGAC
CGCTACCTCG CGGTGGCCGA CATGCTCTTC GACGTCCTCG CGCCGCAGAC CGCCGCCCGG
CACCGCGGGC TCGTCCGCAT CGAGGACGTC GGGCCGGACG CCGACCCCGC GGAGCTGCAC
GCGATCGCCG ACTACCTGTC CTCTGCGCAG GTACCGTTCT CGGTCGCCGT CTACCCGCGG
TACGTGGACG CGAACGGCAC GTACAACAAC GGCGTACCGC AGGACTACAC TCTCGCGTCC
AAACCGGCGG TGGTCAGCGC GCTGAAGTAC ATGACCCAGC GCGGCGGCAC ACTGATCATG
CACGGCTGGA CGCACCAGTT CTCGAACGTC GCCAACCCGT ACTCGGGCGC GAGCGCGGAC
GACTTCGAGT TCTTCCGGGC GCATGTCGAC GCCCAGGACT ACGTGGTCTA CGACGGACCC
GTGCCCGGCG ACAGCCAGGC GTGGGCGACC GACCGGATGA ACGGCTCGGC CGCCGCGTTC
ACCGCCGCCG GGCTGCCGGT GCCGACGACC TTCGAGTTCC CGCACTACGC CGCGTCCGCC
CCCGACTACG CCGCGGCCCG GGCGAAGTTC CCGCGCCGCT ACGACCGCGG GCTCTACTTC
CGCAACCAGC TCGCCGGCGG CGCGGTGGAC CACACGAAGT ACGGCGGCCA GTTCTTCCCG
TACCCGGTGA CGGACGTCCA CGGGTCCTTC GTCATTCCAG AGAACATCGG GAACATCGAG
CCCGAGCCGT TCAACAACCA CCTGGCGCGG CTGCCCGCGG AGCTGATCGA CGCGGCCCGG
CGCAATCTGG TCGTCCGGGA CGGTTTCGCC AGCATGTTCT ACCACCCGTA TCTGGGCGTT
GACTACCTGC GCCAGACGGT GGAGGGCGTG CGCGCCCTCG GATACACCTT CGTCGCGGCC
GGCTCCGTCG TCGCGGGCGG GTAG
 
Protein sequence
MLPAEGRFAS PPAPPAPPRT PPPRRAARPV PGTNQRPPAA PDGATGPGAA SAPSGQATGT 
GTLILYDTTG AWGWLGEQYA MQAANLASRF GTWQARPVSS YTAGQMSAYA AVVYVGSTYD
EQVPTAFLTD VLAGNRPVVW MYNNIWQLTS QAPTFPTTYG WNWSGFDTSA IGTVSYKGTD
LTRYTANAAG IMNYASVDTT RATVLAEAVR GDGTRFPWAL RSGNLTYIGE IPFAYADMTD
RYLAVADMLF DVLAPQTAAR HRGLVRIEDV GPDADPAELH AIADYLSSAQ VPFSVAVYPR
YVDANGTYNN GVPQDYTLAS KPAVVSALKY MTQRGGTLIM HGWTHQFSNV ANPYSGASAD
DFEFFRAHVD AQDYVVYDGP VPGDSQAWAT DRMNGSAAAF TAAGLPVPTT FEFPHYAASA
PDYAAARAKF PRRYDRGLYF RNQLAGGAVD HTKYGGQFFP YPVTDVHGSF VIPENIGNIE
PEPFNNHLAR LPAELIDAAR RNLVVRDGFA SMFYHPYLGV DYLRQTVEGV RALGYTFVAA
GSVVAGG