Gene Francci3_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2032 
Symbol 
ID3906749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2389395 
End bp2390633 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content73% 
IMG OID637879369 
Producthypothetical protein 
Protein accessionYP_481135 
Protein GI86740735 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.570232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCT CCGATCCGGA CGCGCTCGCC GCCGCGAACA CTGTTGCCGA AGCACTAGCC 
GACCCGCACA CCGCCTGGGC CGCCCACCGA CCGACCGGGG GAAGGGCGTG GCCGCAGTCG
CTCGCCGGCG GCGCGGCGGG CATCGCCCTG CTGCACATCG AACGTGCCCG CTCCGGGTAC
GGCGACTGGA GCACCGTGCA CGCCTGGCTG TCCGCCGCCG CGTCCGACGC CCTGACCGCG
GCGGCCAACG CGGGCCTGTA CCTGGGCGCT CCGGCCCTGG CCTTCACCCT GCACACCGCA
GCCGGACCAT CAGGCCGGTA CCACCGTGCC CTCGCCCATC TGGATCAGGC CGTCGTCGCC
ATGACCCGCA CCCGACTCGC CGCGGCCCAC ACACGCATCG AACAGAGCCG GCGGCCCGCG
ATGAAGGAGT TCGACCTGAT CCGGGGCCTG ACCGGACTCG GCGTCTACCA CCTGCGCCGC
CACCCCGATC ACCCGATCAC CGGCGAACTG CTGTCGTACC TGGTCAGACT GACCGAACCA
CTGGCCGGAA GAGACGACCT CCCACCCTGG TGGACGGACT CTGCACCCAA CGGCGAACCC
AGCCCCGAAT TCCCCCAAGG ACACGGCAAC GTCGGCCTCG CGCACGGCAT CAGCGCCGTC
CTCGCCCTGC TTGCCCTGGC CCACCTGCGC GGCCTGCCGG TCCGTGGCGC CGACGACGCG
ATCGCACGGA TCTGCGCCTG GACCGACCGC TGGTGCCAGC ACGGCGACAC CGGCCCCTGG
TGGCCCGGAT TCATCACCCT CCGCCAGGTC CGTGAAGGCA AGGTCGCCGC AACCCTGCGG
CCCCGCCCCT CCTGGTGTTA CGGCGTCAGC GGCACCGCCC GCGCCCAACA GCTCGCCGGC
ATGGCCCTGC GCGACACCGC ACGCCAGCAA GCCGCCGAGA ACGCGCTCCT CGCGGCACTT
CGCGATGAGG CGCAACTCGA CCAGCTCACC GAGATCGGCC TGTGTCACGG CACCGCCGGG
CTACTCCAGT CCGCCTGGCG CATGGCGGCC GACTCCCACC ATCCCCAGCT CACCGCCGAA
CTCCCCGGCC TGTCAGCCAG GCTGATCGCA CAGATGGGCA CAACCGTGCG CGACCCCGAA
CTTCTCGACG GCGCCGCCGG CGCCGCCCTC GCCCTGCACA CCGCCGGCAC CGGCGCCGCC
CCGACGTCGG GCTGGGACGC CTTCCTCCTG CTGGCCTGA
 
Protein sequence
MTSSDPDALA AANTVAEALA DPHTAWAAHR PTGGRAWPQS LAGGAAGIAL LHIERARSGY 
GDWSTVHAWL SAAASDALTA AANAGLYLGA PALAFTLHTA AGPSGRYHRA LAHLDQAVVA
MTRTRLAAAH TRIEQSRRPA MKEFDLIRGL TGLGVYHLRR HPDHPITGEL LSYLVRLTEP
LAGRDDLPPW WTDSAPNGEP SPEFPQGHGN VGLAHGISAV LALLALAHLR GLPVRGADDA
IARICAWTDR WCQHGDTGPW WPGFITLRQV REGKVAATLR PRPSWCYGVS GTARAQQLAG
MALRDTARQQ AAENALLAAL RDEAQLDQLT EIGLCHGTAG LLQSAWRMAA DSHHPQLTAE
LPGLSARLIA QMGTTVRDPE LLDGAAGAAL ALHTAGTGAA PTSGWDAFLL LA