Gene Francci3_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2199 
Symbol 
ID3906338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2571187 
End bp2572446 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID637879531 
Productepoxide hydrolase-like 
Protein accessionYP_481297 
Protein GI86740897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.563968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG AAACCACGCC GTCGCACGCC GTCGAGACCC GGCCGTTCCC ACTGAAGCCG 
ACGCCGATTC ACGTGCCCGA CGACGTGCTC GCCGACCTCC AGCGGCGCCT GGAGTTGACT
CGCTGGCCGC TCGATGCGGG CAACGAGGAC TGGTATTACG GCGTGAACCG CGCCTACCTG
CAAGAACTTG TCGACTACTG GCGCACCGGC TACGACTGGC GCAAGTCCGA AGCCGCCATC
AACGCCTACG AGCACTACCA GGTCGAGGTC GAAGGCGTGC CGGTGCACTT CATGCGCAAG
GCCGGAGTCG GCCCCGATCC GACCCCCCTG ATCCTCACCC ATGGCTGGCC CTGGACGTTC
TGGCACTGGT CCAGGGTCAT CGATCCGCTG GCCGACCCCG GCGCGTACGG CGGCGATCCC
ACCGAAGCAT TCGATGTGAT CATCCCCTCG TTTCCCGGCT TCGGGTTCTC CGTACCGCTG
CCGAACAACC CGGACCTGAA CTTCTGGAAG GTCGCCGACC TCTGGCACAC CCTCATGACT
CAGACCCTCG GCTACGACAG GTACGCCGCC GCCGGCTGCG ACGTCGGAGC CCTGGTTACC
GGCCAGCTTG GGCACAAGTA CGCCGACGAG CTGTACGCCA TCCACATTGG CTCCGGCCTG
AAGCTCACCC TGTTCAACGG CGACCGGGCC TGGGACCTCA GCGGCGGCCG GCCCATCCCC
GACGGCCTTC CTGACGACAT CCACGCCCAG ATCGTCGCCG TGGAGAGGCG CTTCGCCGTC
CACCTCGCCG CGCACGTGCT CGCCCCGAGC ACGCTCGCCT ACGGGCTGTC CGACTCCCCG
GCCGGGATGC TCGCCTGGAT ACTCGAACGC TGGGTGAAGT GGAGCGACAA CGGCGGCGAC
ATCGAGACCG TCTTCACCAA AGACGACCTG CTCACCCATG CCATGATCTT CTGGGTGACC
AACGCGATCG GTACCTCGAT CCGCACCTAC GCCAACAACA ACCGCTACCC GTGGACCCCG
TCCCACGACC GGCAGCCAGC CATCGAGGCG CCCACCGGCA TCACCTTCGT CGGCTATGAA
AACCCACCCG GCGTCAGTAC CGACCAGCGG GTTCAGAACT TCCTCGACTC CGACCGCGCC
GCCTGGTACA ACCACGTCAA CCTCAACGCC CACGACCACG GCGGCCACTT CATTCCCTGG
GAAATCCCCG CTCAATGGGT CGACGACCTG CGGCGTACCT TCCGCGGCCG CCGCTACTGA
 
Protein sequence
MSHETTPSHA VETRPFPLKP TPIHVPDDVL ADLQRRLELT RWPLDAGNED WYYGVNRAYL 
QELVDYWRTG YDWRKSEAAI NAYEHYQVEV EGVPVHFMRK AGVGPDPTPL ILTHGWPWTF
WHWSRVIDPL ADPGAYGGDP TEAFDVIIPS FPGFGFSVPL PNNPDLNFWK VADLWHTLMT
QTLGYDRYAA AGCDVGALVT GQLGHKYADE LYAIHIGSGL KLTLFNGDRA WDLSGGRPIP
DGLPDDIHAQ IVAVERRFAV HLAAHVLAPS TLAYGLSDSP AGMLAWILER WVKWSDNGGD
IETVFTKDDL LTHAMIFWVT NAIGTSIRTY ANNNRYPWTP SHDRQPAIEA PTGITFVGYE
NPPGVSTDQR VQNFLDSDRA AWYNHVNLNA HDHGGHFIPW EIPAQWVDDL RRTFRGRRY