Gene Franean1_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1073 
Symbol 
ID5669487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1269206 
End bp1270945 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content74% 
IMG OID641240002 
Producthypothetical protein 
Protein accessionYP_001505435 
Protein GI158312927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.542041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGG AAGAACTCAT GGTGGCTGGG CCGCCCCTGA CCCTGTCGGG CCCGATCCGG 
CCGGGCCGCC CGTCGGCCCG TATCTCCCCG GACCACACCT CCCCGGAGAT CATCCCGCCG
GACCGCATCT CGACAGCGCA GTCCGCCGCC GACATCCCGG GCACGCACAT CCCGGCCACC
CAGCTCCCGA ACAACCAGGT CCCGGCGCCG TCGATGGCGT CCACCGGGCA CGCCAGGCCG
ACGGCGAAGG CTGCCGCGGG CAAGGCCGCC GGCGGAACGT CCACCGGCGC GGGTGGCGCT
GGCGGCGCCA GCAGCACCCG CGCAGACGGC ACCAGGGGTG CCGGTACCAG CCCCAACACC
GGCGAATCCG GCACGCCGAC CCTGTTCTCC CGCCTGCTCG CCGAGGCCAA CGTCTCCGAC
ACCCGCTTCG CACGTCAGGT CAACAACCGC GCCCGATCCC AGCGCCGCAT CGAGCTCGGC
CTAGCCCGGA CGACCGTCGG GCACTGGCGC CGCGGCATGC GGCCCCGGGA CCCGATGGTC
GCCGAGCTGG CCGCGGCCGA GCTCTCCGCC CTCGTCGGCT ACCCGGTGAC CCCCGCCGAC
CTCAGCTGGC GGGGCGAAGC CAGTGAACGC GACGACCTCG GCCTCGCCGT CGCGGACATC
CCCGACGACA CCCTGCGGAC GCTCGCTGGA CTTTCGGGAC GAGACATGCG GCGACGTGAC
GTCCTACATG ACGGGGCGGC CTTCGTCGCC ACCGCCTTCG CCGACCCCGT GCTGTCCAGC
CTCACCGGCA TGATCCGCCG GATCAGCGCG GACGTCCCGT CCTCCCCGTC CGGCGGAGCG
ATGATCCGGG ACATGACCGA GACGTTCCGC CGCCTGGACG CCCGGTTCGG CAGCAGTGAG
ATCCGCCCCC AGGTCGTGAC GTTCCTGCAC GACCGGACGC GGGCGGCTGT GGCGGGACCG
GCCGACACCG ACACCTTCGG CGCCCTCGCC GAGCTGGCCC AGTTCAGCGG CTGGCTCGCC
CAGGACTGCA ACCGCCAGGC GCTGGCCCAG CGCTACTACA TCCAGGCACT CACGCTGGCC
GAGCACGCCG ACGACGTCAT GATGGCCGGC CGGGTGCTGT CAGCGATGAG CGACCAGTCC
GCGGCCCTGG GGCACAACCG GCACAGTCTG TCCCTGGCGC GCGCGGCGAT CGACCGGTCC
GCCCGGCAGT CCGCGCCGGC CGTGCAGGCG ATGCTGCAGG ACAAGCTGGC GTGGGCCCTC
GCCCGCAACG GCGACGAGGC CGGCTGCATG CGTGCCCTGG ACGCGCTGGA GCGCACGATC
TCCCGCGAGC CCGGCGACGC CCCGTCCTGG GCCGGGCACT ACAACATCGG CGACGTGGCC
GAGTGTCAGG GCCACTGCCT CCTCCTGCTG GGCCGGGCGG AGATGGCCGA GAAGCGGCTG
TTGGAGGCAC GTGACCTGCA GGGTCCGGCG CGGGCCCGGA CCCGCGCGTA CGCGGAGGCG
GACCTGGCGC TGTCCTACCT GAAACGCCCG CGCCCCGAGC TCGAGGCGGC CCTCGAAGCC
GGGTACCGGG CGGTGGAGGT GGCCGGCCCG GTGTCCTCCA CCCGGATCGT CAACAAACTC
TCCGAGCTGG ACCGGACGAT CGCCGGCTTC TCGAAAGCCG TCGCGGCCCG TGAGTGGCGC
TCACGCGCCG CTGGTCTCGT GCGACCTTCC CCCCAGCGGC CGGAACCCGC CGTCGGCTGA
 
Protein sequence
MVAEELMVAG PPLTLSGPIR PGRPSARISP DHTSPEIIPP DRISTAQSAA DIPGTHIPAT 
QLPNNQVPAP SMASTGHARP TAKAAAGKAA GGTSTGAGGA GGASSTRADG TRGAGTSPNT
GESGTPTLFS RLLAEANVSD TRFARQVNNR ARSQRRIELG LARTTVGHWR RGMRPRDPMV
AELAAAELSA LVGYPVTPAD LSWRGEASER DDLGLAVADI PDDTLRTLAG LSGRDMRRRD
VLHDGAAFVA TAFADPVLSS LTGMIRRISA DVPSSPSGGA MIRDMTETFR RLDARFGSSE
IRPQVVTFLH DRTRAAVAGP ADTDTFGALA ELAQFSGWLA QDCNRQALAQ RYYIQALTLA
EHADDVMMAG RVLSAMSDQS AALGHNRHSL SLARAAIDRS ARQSAPAVQA MLQDKLAWAL
ARNGDEAGCM RALDALERTI SREPGDAPSW AGHYNIGDVA ECQGHCLLLL GRAEMAEKRL
LEARDLQGPA RARTRAYAEA DLALSYLKRP RPELEAALEA GYRAVEVAGP VSSTRIVNKL
SELDRTIAGF SKAVAAREWR SRAAGLVRPS PQRPEPAVG