Gene Franean1_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1340 
Symbol 
ID5669751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1612841 
End bp1614385 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content73% 
IMG OID641240271 
Producthypothetical protein 
Protein accessionYP_001505698 
Protein GI158313190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC CGCCGGTGTA CCGGCTTACC CGCCGCCGGT CCGGCGCGAC CCACTTCGGT 
CTCGCCGGGG CACCGCTGGT GCTGGTCGCA GGGGGTTTCG CCGCGCTGGT TCTGCTTCCG
CTGCTGACCG ACAGCGTAGC GATCGGAGTC GTCGTGGCCG CTGCCTGCGG GCTGGCCGCG
TTCCTGCCGG TACCGGGCGG AGGGCCCGTC TACCAGGCTG TACCGCTTGC CATGCGGCAT
CTGGCCCGTC GGGTCGGAAG GCGGCACCAG TGGACCGCTT CGCTGCCCCT GCTCGCCGGC
GCTCCTGCCA CGGCGGAGAG CACGAGGGGC GAGGCCCGAC GGCTCCTACC GCCGCCGCTA
CGAGGTCTGG AGATCCTCAC CGTCCCCCGT GCTTCCGCGA CGGGCGCGAC GCGCAGCCTC
GCTCCCATCG CCCTGATCCA TGACCGGCGC GCCGGCACGC TCACCGCGGT GCTGTCCGCA
CGCGGCAGTG AGTTCGGGCT GCTCGAGCCT GCTGACCAGC ATCACCGGCT GTCCGCCTGG
GCGCAGGTCC TCTCCAACAC TGCCCGCGAC AGCGGCGTGG TGCGGCTGGG CTGGTCGCTG
TGGTCCGCTC CTGTCTCGCC GGCCGACCAT GTGCACTGGC TGAAGGACCG CCACCCTGAC
ACGGCCACGG CGGGCGTCGG GCACACCCGG GCGGCTGAGG ACTACCAGAC GCTGCTCGAG
AACGCCGCGG CGACGCTGAC CCGCCATGAC TTACGGCTGT GGCTGTCACT CGACACAGGC
CGGCTGCCCC GCCGCGCCGA CCCGACTGAC GCCGCGGCGC AGGCCGCGCT CACCCTGGCC
GAACGGTGCC GCGCCGCCGG CCTGGTCGTG GATGATCCCG ATTCGCCGGT GGGCGTCGCC
GAGGCGCTGC GTCTGCGCGC CGACCCGTCC GTGGCCGCCA CCCTCTCCAG AGTGCAGCGC
ACGCTCGCAC AGCAGATGGG CACCGCCAGC GTTATAGACG GCGTGCACGC CGGGCCGTTG
TCGATGCACG CGCAATGGGA CGCCGTGCGC ATCGACGACG TATGGCACCG GGTCTTCTGG
GTGTCACAGT GGCCGACGGC CGCCCTGCAC CCGGGCTGGC TCGACCCCCT GCTGTTCGAC
GTCTCCTGCG TCCGCACGGT GGCGCTCCTG CTGGAGCCGG TGTCCGCACG TGCCTCCCGC
CGGCGGATCA ACTCCGACGC CGTCGAGGTG GAGAGCCGGA TGGCGGTGCG GGAACGGCAC
GGTTTTCGGG TTCCGACCCA CCTGGCAGGG GCGCAGCAGC AGGTCGACGA ACGGGAAGCC
GAACTGCACG CCGGCCACGC CGAATACGGC TACCTCGCAC TCGTCGACAT CGCCGCGCCG
ACCCGCGGCG ACCTCGACGA CGCCAGCCGC CAGCTCGTCG ACGTGGCCGC GTTCGCCGGC
ATCAACGAAA TCCGCCCGCT ACACGGCCGC CACGACCTGG CCTGGGCCGC GACCCTACCC
ACCGGCAGAG CACCCGGCCG CGGGCTTCTC GGTGGATCCC CATGA
 
Protein sequence
MIRPPVYRLT RRRSGATHFG LAGAPLVLVA GGFAALVLLP LLTDSVAIGV VVAAACGLAA 
FLPVPGGGPV YQAVPLAMRH LARRVGRRHQ WTASLPLLAG APATAESTRG EARRLLPPPL
RGLEILTVPR ASATGATRSL APIALIHDRR AGTLTAVLSA RGSEFGLLEP ADQHHRLSAW
AQVLSNTARD SGVVRLGWSL WSAPVSPADH VHWLKDRHPD TATAGVGHTR AAEDYQTLLE
NAAATLTRHD LRLWLSLDTG RLPRRADPTD AAAQAALTLA ERCRAAGLVV DDPDSPVGVA
EALRLRADPS VAATLSRVQR TLAQQMGTAS VIDGVHAGPL SMHAQWDAVR IDDVWHRVFW
VSQWPTAALH PGWLDPLLFD VSCVRTVALL LEPVSARASR RRINSDAVEV ESRMAVRERH
GFRVPTHLAG AQQQVDEREA ELHAGHAEYG YLALVDIAAP TRGDLDDASR QLVDVAAFAG
INEIRPLHGR HDLAWAATLP TGRAPGRGLL GGSP