Gene Franean1_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0529 
Symbol 
ID5668947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp615599 
End bp618124 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content76% 
IMG OID641239457 
Producthypothetical protein 
Protein accessionYP_001504895 
Protein GI158312387 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.628545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGGC TGGTCTGGCA CACCGTCCGC GCCCGCAAGG GCAGCCTGGT GGGCACATTC 
GTCGCGCTGA CGCTCGGCGT GGCGCTGCTG GCCGCGATGG CGCTGACCCT CGTGAGCAGC
GTCGGGGGCG GCGGGGGCCG CCCGACCTGG TACGTCGATG CCGACGTGGT CGTGGCGGGT
GGCGGCATTG TCAGTGTCAC CACCGGTTCA GGCGAGGACC GGGAGACGGC CTCGCTGCGT
ACCGCCACCG CGCGCGGGCT GCCGAGCGGC CTGCGCGACC GTCTGGCCGG CCTGGACGCT
TCCCTGGTGC TGGACTTCGC GGCGTACGCG ACGGCGCCCG GCGCGCCCGG TGACACCGTC
CGCCCGTGGT CGGCCGCCGC CCTGCACGCC TACGGGTGGG TCGCGGGCGG GCCGCCCCGT
GGTGCTCGGG ACGTCGTGCT CAGCGCCCCG ACTGCCCACC GTCCCGGAGC CGAGATCACC
GTCGTCACCG GGCGCGGCGT GGAGCGCTTC GTGGTCAGCG GTGTGCTGCG CACCGACGCC
CCGGCGGCGC TGTACACCGC CGACGCCGTT GCCGAGGAGC TCGCCGACGG CCGCGTCGCG
GCGGTCGCGC TGAACGCGCC GGGCGCCGGC CCAGCCGGCG CCAGCACGCC CGCCGTCATG
TCGTTGGCCG ACGCCGCCCG CGCCGTCGTC GGCGACACCG CGCTCGAGGA CGGCTCCGTT
CAGGTGCTCA CCGGCGACGA CCGGCGCCGA GCCGAGCCGG ACCCCGACGC CGAGCGGCGG
ACGGAGGCGG TCGCCCTGCT CGCCGCCACC ACGGGGCTCG CCGGCTTCGT GTCGATCTTC
GTGGTCTCCG GGACGTTCGC CTACGCCGTG ACGGCCCGGC GCCGTGAGTT CGGCCTGTTG
CGGGCCGCCG GCGCCACGCC CCGCCAGGTC TTTCGGATCG TGCTCGGCGA GGCCCTCACC
GTCGGTGTGC TCGCCTCGCT GGCCGGCGGT GCCCTGGGAG CCGCGATCGC CCCGGACTTC
GCCGCGCGCC TGGCCCGCAC CGGGTTCGTG CCCAGCGACT TCACCGCCCG GTTCGTCTTC
TGGCCGGTGG CCGCCGCGTT CGGGACCGGG CTGGGCGTGG CCCTGCTCGG TGCCTGGGTG
GCGGCGCGGC GAGCGGGGCG GGTGCGCCCG GTCGAGGCGC TGCGCGACGC GGCGGTGGAC
CGTCGGCCGA TGACGACGGC GCGCGCGCTG GCCGGCCTGC TCGCTCTCGG CTGCTGCGTG
CCGCTGGTCG CGGTGCTGGT GGCGAACCCG AGCGCGGACG CCGTCGCGCT CATCATGATC
ACGGCGCTGT TCCTGATCGT CGCCTGCGCG ATGTTCGCTC CGCTCGTGGT GCCGCCGCTG
GTGTCGCTGC TCAGTGCCCC GCTGTCGGGG TCGTCGGGGG CGGTGGGCCT GCTCGCCGGC
CATGGCGCGC GGGCGGCCGT ACTGCGCACC GCGGCGACCG CGGCGCCGAT CCTGGTGACC
GTCGGGATTG CCGGCTCCAC CCTGACCGGG CTCGGCACCC TGCAGGCGGC GACGCAGAAC
GCGGCACGGG AGCGGATCAC GGCCGAGGCA CTCACCATGC CCGTCACCGG GAAGGGGCTG
CCCGACGCGA GCGTCGCCGC CCTGCGCGAG GTCCCCGGGG TGAGTGCCGC CGTTCCCGTC
ACCGAGAGCC GGGTCTACGT TCGGGACGGC GACGAGCCGG AGGGCTGGAC CGGCTACTAC
GCGTCGGGCG CGGACCTCGC CGCCGTACTC GACGTCCCCC TGGTCGCGGG ATCGCTGGCC
GACCTCGCCG GCACCGACAC GGTCGCCGTT CCGGAGGGTC GCTGGGAGCT CGGCGAGACG
GCCGAGCTGT GGCTCGGTGA CTCGACGCCG GCACGGCTGC GGGTGGTGGC CGTCTTCGAG
AGGCAGCTCG ACCTCTCGGA GACCGTCCTG CTGCCGTCGC GGCTGCGCGA CCGCCACGCC
CTCCCGGGGG CCGACGTCGT CTACCTGCGG CTGGCACCGG ACGCGTCGTT GGAACAGGTA
CGCGCGGCTG CCGCCGCCGG TGCCGGAACG GTGGTCGACA CCGGGAGCTA CCTCTCGGCG
GCGGGCGAGG AGGAGGCACG GGTGAACCGG CAGGCGTCGG TCGCCATGCT CGGCCTGTCG
CTCGTCTACA CCGGCATCGC GATCGCCAAC ACCCTCGTGA TGGCGACCCG GGACCGCGCA
CGGGAGTTCG CGACCATCCG GCTCGCCGGT GCCACCCGCC GTCAGGTGCT GTGGGTGGTC
GGCACCGAGG CGGTGCTGGT GACCTGCATC GGGGTGCTGC TGGCCGCGGT CGTCACGGCG
GTCACGGCAC TCGGTGCCCG CCACGGCCTG GCCGACATCG CGCCGTCCGT GCCGCTGGCC
GTGCCCTGGG CTCCGCTCGC CGGGATCGTC CTGGCCTGCC TGGTCACGGC CGTGCTGGCC
AGCGTGATCC CGGCCGCGCT GCTGCTGCGT CGCCGCCCCG CCGAGCTGGC CGGCGTCCGC
GAGTAG
 
Protein sequence
MIRLVWHTVR ARKGSLVGTF VALTLGVALL AAMALTLVSS VGGGGGRPTW YVDADVVVAG 
GGIVSVTTGS GEDRETASLR TATARGLPSG LRDRLAGLDA SLVLDFAAYA TAPGAPGDTV
RPWSAAALHA YGWVAGGPPR GARDVVLSAP TAHRPGAEIT VVTGRGVERF VVSGVLRTDA
PAALYTADAV AEELADGRVA AVALNAPGAG PAGASTPAVM SLADAARAVV GDTALEDGSV
QVLTGDDRRR AEPDPDAERR TEAVALLAAT TGLAGFVSIF VVSGTFAYAV TARRREFGLL
RAAGATPRQV FRIVLGEALT VGVLASLAGG ALGAAIAPDF AARLARTGFV PSDFTARFVF
WPVAAAFGTG LGVALLGAWV AARRAGRVRP VEALRDAAVD RRPMTTARAL AGLLALGCCV
PLVAVLVANP SADAVALIMI TALFLIVACA MFAPLVVPPL VSLLSAPLSG SSGAVGLLAG
HGARAAVLRT AATAAPILVT VGIAGSTLTG LGTLQAATQN AARERITAEA LTMPVTGKGL
PDASVAALRE VPGVSAAVPV TESRVYVRDG DEPEGWTGYY ASGADLAAVL DVPLVAGSLA
DLAGTDTVAV PEGRWELGET AELWLGDSTP ARLRVVAVFE RQLDLSETVL LPSRLRDRHA
LPGADVVYLR LAPDASLEQV RAAAAAGAGT VVDTGSYLSA AGEEEARVNR QASVAMLGLS
LVYTGIAIAN TLVMATRDRA REFATIRLAG ATRRQVLWVV GTEAVLVTCI GVLLAAVVTA
VTALGARHGL ADIAPSVPLA VPWAPLAGIV LACLVTAVLA SVIPAALLLR RRPAELAGVR
E