Gene Franean1_5927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5927 
Symbol 
ID5674248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7199208 
End bp7200356 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content70% 
IMG OID641244775 
Productcarboxylate-amine ligase 
Protein accessionYP_001510177 
Protein GI158317669 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.777707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0696013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACATTC CCTTCTCGTC GTCGCCGAGT TCGAGCCTTG GGATCGAGTG GGAGCTGGAA 
CTCGTCGACC TGCAGAGCCG GCACCTGCGT GGCGGCGCGA CCGAGATCCT CGAGGACCTG
CGGGCCAAGG TCGGCGAGGA GGGCGCCGCC AAGGCCAAGC ACGAGCTCTT CGAGTCGACC
ATCGAAGTGA TCACCGGGGT GTGTCAGACC GTGCCCGAGG CCACCGCCGA CCTGCTCGGC
ACGGTCGAGG TGCTGCGGGA TCTGGCCGAG CGGCGCGGCA TCGGCCTGAT GTGCTCCGGC
ACCCACCCGA TCAGCGAGTA CTCCACCCAG AAGATCACCG CGGACGACCG CTACGACCGC
CTGGTCGGCC GGATGCAGTG GCTGGCCCGG CGGCTGCTCA TCTTCGGGGT GCACGTCCAC
GTGGGCGTGC GCTCGCCGGA GAAGGCGATG CCCATCGTGA ACGCGCTGAT GTCCTACATC
CCGCACTTCC TGGCGCTCTC GGCGTCCTCG CCCTACTGGC TCGGCTCGCA CACGGGGCTC
GCGTCGTCGC GCTCACGGGT GTTCGAGAGC CTGCCGACCG CCGGCCTGCC CTACCCGCTG
CACGACTGGG CGGCGTTCGA GGGGTTCATG GAGACCCTGG TGACGGCCGG CACCATCGAG
ACGATCCGCG AGGTGTGGTG GGACATCCGC CCCCACCCCA ACTTCGGCAC CGTCGAGCTG
CGCATCTGCG ACGGCCTGCC GACGCTGCTC GAGGTCGGCG CGGTCGCCGC GCTCGCGCAG
TGCATCGTCG ACCGGATGAA CACCCAGCTC GACCGCGGGT ACCGGCTTCC GGCCCCGCAA
CGCTGGGTGG TGCAGGAGAA CAAGTGGCGG GCCGCCCGCT ACGGGCTCGA CGCGGAGATC
ATGGTGGACG ACCGCGGCAC CGTGCGCCCG GTCAGCACCG ACATCGTCGA CCTCGTCGAG
GATCTCCTCC CCGTCGCCCG CCGGCTCGGC TGTGAAACCG AGCTCACCAA CGTCGACCGG
ATCCTCACCT CCGGGGCGAG CTACACCCGA CAGGAACTTG CCGCGCGGCG GGCCGGCGGA
GACCTCACCG CTGTGGTCGA TACGCTGCTT GCGGAGATGA ACGCGGGGCG ACCGGTCACC
CACGGGTAA
 
Protein sequence
MHIPFSSSPS SSLGIEWELE LVDLQSRHLR GGATEILEDL RAKVGEEGAA KAKHELFEST 
IEVITGVCQT VPEATADLLG TVEVLRDLAE RRGIGLMCSG THPISEYSTQ KITADDRYDR
LVGRMQWLAR RLLIFGVHVH VGVRSPEKAM PIVNALMSYI PHFLALSASS PYWLGSHTGL
ASSRSRVFES LPTAGLPYPL HDWAAFEGFM ETLVTAGTIE TIREVWWDIR PHPNFGTVEL
RICDGLPTLL EVGAVAALAQ CIVDRMNTQL DRGYRLPAPQ RWVVQENKWR AARYGLDAEI
MVDDRGTVRP VSTDIVDLVE DLLPVARRLG CETELTNVDR ILTSGASYTR QELAARRAGG
DLTAVVDTLL AEMNAGRPVT HG