Gene Franean1_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2103 
Symbol 
ID5670503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2527264 
End bp2529702 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content77% 
IMG OID641241024 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_001506445 
Protein GI158313937 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.858083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCG TCGCGATGGA GACCGGGCTT GGCCTGCCGA GCGGCTCCGA TGCGGCGAGT 
GGCGGTCGGC TGGACCTCCG CCTGGTCGGT CCTGCCGTCG CGGTCTGGGG CGGGGCGGCC
GCGGCGGGGT ACTGGGGCTT CGGGTCGGTG CTTCTCGGCG CCGCCGGCAT GGTGCTGTGT
GCGTGCCCGG TGGGGGTGCT GATCGCGCTG CATCAGGGTC CCGGCGGCCG GCTCGGCCGG
CCGGGAGCGC TGGTGGCGTT CGTCGCCCTG GCGTTCCTCG CGGCCGGGGT TCTCGTGGGC
GGGGCGGCCG CGCGGCCCCG GTTCACCGGC CCGCTCGCCG AGATGGCCCG GGCCCACCGG
ACCGTCAACG CCGAGGTCGT CCTGAGCGAC GACCCGAAGA TCTCGGCGGC GGCGGCGCCG
GCTGCCACTT CGGGCCCGCG GGAGGGGCCG ACGGTCACCG CCCGGGCTCG ACTCGAGGCC
GTGACCGGAC CCGGCTGCCG GCTGCGGGCC TCTGTGCCCG TGCTCCTGGT GGGGCGCGGC
TCGGATCTCG CGAACTACCT GCCGGGGCAG CGGCTCGGCG TCCGCGCCGG GCTGGCGCCG
GCGGGTCCGG GCGACACGAT CGCGACGGTT CTGTTCGTCC GGGCACCGCC ACGGGCACAG
GGGCGCCCGG ACGCCGTGCA ACGGGCCGCC GGGTGGCTGC GGGCCGGCCT ACGGGACGCC
GCCGACGTGG TGCCCCAGCC CGCCGGCGGG CTGCTGCCGG CACTGGTCGT CGGTGACACC
TCGGGCCTTG ATCCCGGGCT GAAGGACGAC TTTCGCACCG CCGGGATGAG CCATCTGACC
GCGGTGTCCG GCGCCAATCT GGCGATTACC GCGGGGACGG TGCTGTTCCT GCTCGGGCGG
CTCCGGCTCG GGGCCCGCTC CAGGGCGGTC GCCGCGGCCC TGGTCCTCGT CGGGTTCGTG
ATCCTGGCCC GGCCGTCCGC CAGCGTGGTC CGGGCCGGAG CCATGGGACT GGTCGGCCTG
GTCGCGCTCG CCGCGGGTCG GCCCCGCGCG GTGCTGGCGG CGCTGGCCAC CGCGGTCATC
GCCGTCGTGA TGGCGGATCC GGCGTTCGCG CTGTCGGCCG GGTTCGCCCT GTCGGTGCTC
GCGACGACCG GGATGATCGT GTGGGGGCCC GGCTGGAGTG ACGTGCTGGA GCGCGGCCGC
GCGGCCGGCC GGCTCGGCGA GGTCGTGGCG GTGGCCGCCG CCGCCCAGCT TGCCTGTACG
CCGGTGCTGG CCTGGCTCGG TGGGGGCATC AGCATTGTCG CGATCCCGGC CAACGTGGTG
GCGGCGCCGG CGGTCGCGCC CGCCACCGTG CTCGGCGTGT CGGCGATGGC CGTCGCCGCC
GTCAACGACC CGGTCGCCGC GCTGCTCGCG CGGCTTGCGG GGCTGGCCTG CCACTGGCTC
GTGCTGGTCG CGGACGTGGC CGCCGGTGTC CCCGCGGCCA CGATCGGCTG GCCGGCGGGG
CTCGCGGGGG CGGCCACCGC GCTTGGCTGC GTCGTACTGG TGGTGGCTCT GGCGCGCCGG
CGTCCGACCC GCTGGCTGCT GGTGTCGGCG GTGGTGGGCC TGCTGGCGGC CCGGGTGCTC
CTGCTGCCGA GGCTGGCGGG CTGGCCGCCG CCCGGCTGGC GGCTCGTTGC CTGCGATGTG
GGCCAGGGCG ACGGCCTGGT CCTGCGCGCC GGGCCGGCCT CGGCGGTCGT GGTCGACGTG
GGTCCCGACC CGGCGCTGAT CGCCGCCTGC CTGGATGACC TCGGAGTACG CGAGGTGCCG
CTGCTCATGC TCACCCATCT GCACGCGGAT CACGCCGCCG GCCTCAGCGG GGTGGTGGGC
CGGCTGCCGG TGGGGGAACT GGTGGTGAGC CCGCTGCCCG AGCCGGCCGA CCAGTGGGAC
GCGGTCGAGC GCGCGGCGCG GGCGGCGGGC GTGCCGGTGC GGGCCGTCAC CGCCGGCGCC
GCGGGGGAGA CCGGAGCGGT CCGCTGGCGG GTGATCGGCC CGGAACGGGT CCTGCGGGGC
ACGGCCAGTG ATCCGAACAA CGCCAGTCTC GTGGTCCTGG CGGCGGTCGG CGGCGTGACG
ATACTGCTCA CCGGGGATGC CGAGCCGCCG GAGCAGCGGC AGGTGGCGCG GCGTGGTCTC
GGGCCCGTCG ACGTGCTCAA GGTGGCTCAC CACGGCTCCG AGGACCAGCT GCCGGAGTTC
CTCACCCGGA CCGGCGCCGA GGTGGCGCTG ATCAGCGTCG GCGTCGACAA CACCTACGGG
CATCCCGCGC TGAGCACGCT GGCCGGCCTG CGCGCGGCCG GGATGGCGGT GGCCCGCACC
GACCTGCACG GCACGGTGGC CGTCGTCGAG ACGGCCGGCG GTGGCGTCCG GGCGGTGGCT
CGCCGGCCCG GCCCGCGAGG GGGCGGGGCC GGCTCGTGA
 
Protein sequence
MVAVAMETGL GLPSGSDAAS GGRLDLRLVG PAVAVWGGAA AAGYWGFGSV LLGAAGMVLC 
ACPVGVLIAL HQGPGGRLGR PGALVAFVAL AFLAAGVLVG GAAARPRFTG PLAEMARAHR
TVNAEVVLSD DPKISAAAAP AATSGPREGP TVTARARLEA VTGPGCRLRA SVPVLLVGRG
SDLANYLPGQ RLGVRAGLAP AGPGDTIATV LFVRAPPRAQ GRPDAVQRAA GWLRAGLRDA
ADVVPQPAGG LLPALVVGDT SGLDPGLKDD FRTAGMSHLT AVSGANLAIT AGTVLFLLGR
LRLGARSRAV AAALVLVGFV ILARPSASVV RAGAMGLVGL VALAAGRPRA VLAALATAVI
AVVMADPAFA LSAGFALSVL ATTGMIVWGP GWSDVLERGR AAGRLGEVVA VAAAAQLACT
PVLAWLGGGI SIVAIPANVV AAPAVAPATV LGVSAMAVAA VNDPVAALLA RLAGLACHWL
VLVADVAAGV PAATIGWPAG LAGAATALGC VVLVVALARR RPTRWLLVSA VVGLLAARVL
LLPRLAGWPP PGWRLVACDV GQGDGLVLRA GPASAVVVDV GPDPALIAAC LDDLGVREVP
LLMLTHLHAD HAAGLSGVVG RLPVGELVVS PLPEPADQWD AVERAARAAG VPVRAVTAGA
AGETGAVRWR VIGPERVLRG TASDPNNASL VVLAAVGGVT ILLTGDAEPP EQRQVARRGL
GPVDVLKVAH HGSEDQLPEF LTRTGAEVAL ISVGVDNTYG HPALSTLAGL RAAGMAVART
DLHGTVAVVE TAGGGVRAVA RRPGPRGGGA GS