Gene Franean1_2450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2450 
Symbol 
ID5670846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2911614 
End bp2912954 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content78% 
IMG OID641241367 
Producthypothetical protein 
Protein accessionYP_001506788 
Protein GI158314280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.454806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACG GTCAGACGGC CCCCGTCGCC GTCCAGGCAT CCCCGACCCA GGTGCCCCGC 
CAGGCCCCCG GCCGGACCGG TCCCCGCAGT CCGCACACCG GCCGGCCCCG GTCCGGCGAG
GATGCCCGCC GCTCCGGCAC CGGCGGCCCC CGCACCGGCC AGCCGCTCCC CGGCGCCGAC
ACGGCCGTCG GCCAGTGGCC GGGAACCACC GGCTCGGCGC CGGGCACACC GCCGCGCCGC
CCCGGCTCCG TCCGGCGGAC GACCTCGATC ATGATGGAGC GCCCGGACGG CCTCACCGGC
CCGCTGCACC TCACCGGTGT CGGCCGTGAC CTGCTGACCA CCGCCGCGGG CGAGGTGGAG
GTCCTCGGCG AGGCGCGGCT GCGCGTCGTC CTCGACTACA TGGCCGCGCC CGGCCCCGTC
CTCGAGATCG AGTCCGAGCC CGCCGTTCCC GGCCTCGCGA GCCTGGTCGG CGCCTCGGCG
CGGGCCGGCT TCCGCGGCGC CGCCCGGCAG ATCCTCGCGG CCGCCGACGC CGCCGGCGCA
GAAGGCGACG CGGCCGACAG CAACGGCGAC GGAGCTGGTG GGGCGTCCAC CACCCGGCGC
TCGCTGCTCG AGCAGCTCCT CGACGACGTG CCCGTGGCGA CCCTGGTGTC GGGCTCCGCG
CTCGGGCGCA ACGGCATCTT CCGCGACGAC CCCTCGCGGC ACGCCGGCGC GGAGGCCGGC
GGCAACCCGA TGCTCGACGT CTGCGCCGGC TGGCAGCGGG AGGGGCTGCT CGCCCGCGTC
TCCGCCGAGC GCAGCGCCGA CCCCGAGGCC CCGGTGCGGA CGACCATGGT CGTGGCCGGC
GACCTCGCGG TCGACGAGGA CCCGCTCGGG TGGCATCCGC TGCCGCCGAT GGGCACCCAT
GCCATGCGCC GCCTGCGCCG CATCGACCTC GTCCCGATCG GCGGCGAGCC CGCCGGCACC
GGACCCGCCC GTACCGGTGC CGCGGACGGG GCCATCGACG GGGCCGCCGC CTTCCGGGTC
GACGCCTTCT TCCGTGACAC CTACCGGGAG GTGTCCGGCG CCGAGGTCGT CGTGCACGAG
TACGGGATCG ACGCCACGGT CGACCAGTCG GCCCGCGTCG TGCGCTCGGT GGGGCGGGCC
GGGGTCCTGC CGGGCCCCGA GTGCCCCCAG GCACTGGCCA GCGCCGAGCG CATCGTGGGG
CTCGGCGCCG CGGACCTGCG CCGCACCGTC TCGCGCACCT TCACAGGGAC GACCACGTGC
ACGCACCTCA ACGACACCCT GCGCTCCCTG GGCGATCTCC CCGACCTGAT CGAGCGGCTC
CGCTCGGCCA CCGCCCGCTG A
 
Protein sequence
MAHGQTAPVA VQASPTQVPR QAPGRTGPRS PHTGRPRSGE DARRSGTGGP RTGQPLPGAD 
TAVGQWPGTT GSAPGTPPRR PGSVRRTTSI MMERPDGLTG PLHLTGVGRD LLTTAAGEVE
VLGEARLRVV LDYMAAPGPV LEIESEPAVP GLASLVGASA RAGFRGAARQ ILAAADAAGA
EGDAADSNGD GAGGASTTRR SLLEQLLDDV PVATLVSGSA LGRNGIFRDD PSRHAGAEAG
GNPMLDVCAG WQREGLLARV SAERSADPEA PVRTTMVVAG DLAVDEDPLG WHPLPPMGTH
AMRRLRRIDL VPIGGEPAGT GPARTGAADG AIDGAAAFRV DAFFRDTYRE VSGAEVVVHE
YGIDATVDQS ARVVRSVGRA GVLPGPECPQ ALASAERIVG LGAADLRRTV SRTFTGTTTC
THLNDTLRSL GDLPDLIERL RSATAR