Gene Francci3_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1692 
Symbol 
ID3903269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2029410 
End bp2030720 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content76% 
IMG OID637879030 
Producthypothetical protein 
Protein accessionYP_480797 
Protein GI86740397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000294566 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC CCGTACGAAT TCCCGCCGAT GTTGATCGCG AAGATCGGAT CATGGCGGGC 
CTGACCGCCC GGCAGGTGTT GATCCTCGCG CTGACCGCGA TCGTGCTCTA CCTCGCCTGG
GCCGCGACCC GTGCCCTGCT GCCGTTGCCG GTGTTCGCGC TGCTCGCCGT CCCGGTCGCC
GCGGGCGCCG GCGTCCTCGT CCTGGGCCAG CGCGACGGGC TGTCCCTCGA CCGGATGCTC
GTCGCCGCGA TCCGCCAACG CACCAGCCCA CGACACCGCA TCAACGCCCC CGAAGGAGTG
ATTCCGCCGC CGTCCTGGCT GGCCGCCCGC GCCACGAGCA GCTCCGGTGA CCGACGGCCG
GCCGCGGGCG GGCAGAGCGC GGCGCCGCTG CGGCTACCGG CCCGCACCGT CACCACCAAC
GCCGGGGTCG GCGTGATCGA CCTCGGGCCG GACGGGCTTG CGGTCGTCGC GGTCGCGAGC
ACGGTGAACT TCGCGCTGCG CACGCCGGGC GAGCAGGACG GGCTGGTCGC CGTGTTCGCC
CGCTACCTGC ACTCCCTGAC CGCGCCGGTG CAGATCCTCG TGCGGGCCAT GCCCGCCGAC
CTGACCGACC AGATCCGTCA ACTCGACGAC GCCGCCGACC AGCTGCCCCA CCCCGCGCTC
GCGCACGCCG CCCGCGAACA CGCCACCTAC CTGGCCCAGC TCGCCGACGA GATGCAGCTG
CTGACCCGCC AGGTCCTGCT GGTCCTGCGA GAGCCGCTCG TGGCGGCCGG CCCGGTCGAC
GGGCTCGGGG GCGCATCCCC GCTGGCCGCA CTGTCCGGCC GACGGGCGGC GGCCCGCGAC
GCCCGCCGCG CCGGAGCGGC CATCCGACGG GCCGCGCACA CCCGGCTCGC CCGCCGGCTC
GCCGAGGCGA CCGACCTGCT GGCACCGGCC GGGATCGTGG TCACGCCGCT GGACGCGGGC
ACGGCGACCA GCGTGCTGGC CGCTGCCTGC AACCCGGCCG GCCTGGTGCC GCCGGCCGCG
CTCGCGGCCC CCGACGACGT CATCACCGCC GATGTCCCCG AGCCCGTCGA CAGCTACTCG
GCCTACCAGC CCGACACCGA CGACGGCTTC CTGGACGACG CCGGGTTCGA CGACCCGGAC
GCGGCGGTCG GAGCCGGCTA TGGCGACCGG TTCGACGACG CCGATGGGGA CGGCCCGCTC
AACGACCCCG ACTTCTGGGA CCCGCCCGCC CTGCGCCCGC CGGCCGGGCG TTCCGACGGC
GGCTCCCGAC GGCCAGCACG ACACACGGCG CGCAGGGGAC ACGCCCGATG A
 
Protein sequence
MTHPVRIPAD VDREDRIMAG LTARQVLILA LTAIVLYLAW AATRALLPLP VFALLAVPVA 
AGAGVLVLGQ RDGLSLDRML VAAIRQRTSP RHRINAPEGV IPPPSWLAAR ATSSSGDRRP
AAGGQSAAPL RLPARTVTTN AGVGVIDLGP DGLAVVAVAS TVNFALRTPG EQDGLVAVFA
RYLHSLTAPV QILVRAMPAD LTDQIRQLDD AADQLPHPAL AHAAREHATY LAQLADEMQL
LTRQVLLVLR EPLVAAGPVD GLGGASPLAA LSGRRAAARD ARRAGAAIRR AAHTRLARRL
AEATDLLAPA GIVVTPLDAG TATSVLAAAC NPAGLVPPAA LAAPDDVITA DVPEPVDSYS
AYQPDTDDGF LDDAGFDDPD AAVGAGYGDR FDDADGDGPL NDPDFWDPPA LRPPAGRSDG
GSRRPARHTA RRGHAR