Gene Franean1_3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3696 
Symbol 
ID5672062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4375242 
End bp4377068 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content68% 
IMG OID641242579 
Producthypothetical protein 
Protein accessionYP_001507999 
Protein GI158315491 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2072] Predicted flavoprotein involved in K+ transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.761481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACT GTGCCCCGAC CGAGACGCCC GAGGTCGACC ATGACGCGCT CCGGACGAAG 
TATCTGGGAG AGCGGGACAA GCGCCTGCGC CGTGAGGGCC AGAAGCAGTA CCTGGTGACC
GAGGGCGACT TCGAGGAGTT CTACGAGGCC GACCCGTACA CACCGGTCGT GCCCCGGACG
CCGATAGTCG AGGACATCGA CGTGGCGATC TTAGGCGGCG GCTGGGCGGG CCTCGTGGCC
GCGGCGCGGC TCAAGCAGGA GGGCGTCACC AACTTCCGGA TCATCGAGCT GGCGGGCGAC
TTCGGCGGGG TGTGGTACTG GAACCGGTAT CCCGGCATCC AGTGCGACAC CGACGCCTAC
TGCTACCTTC CGCTGCTCGA GGAGGTCGGT TACATCCCGA AGGAGAAGTA CGCCCGGGGC
GACGAGTGCC TGGAGCACGC CCAGCGCATC GGGAAGCACT TCGGCCTGTA CGAGCACGGG
GTGTTCAGCA CGCTGGTCCG CTCGCAGGAG TGGGACGAGC GGACGAATCG CTGGAAGATC
ACCACGAACC GCGGCGACGA GATCAGCGCC CGGTTCGTGG TGATGTGCCA GGGGCCCTTC
AACCGGCCGA AGCTGCCGGG CATCCCCGGC ATCGCCGACT ACCAGGGCCA CACGATCCAC
ACCGCCCGGT GGGACTACGA GTACTCCGGC GGCGACCTGC ACGGCGGGCT GGACAAGCTC
GCCGGCAAGA AGATCGCGGT GATCGGCACC GGGGCCAGCG GCGTGCAGGT CATCCCGCAC
ATCGCGCAGA CCGCCGAGCA CCTGTACGTG TTCCAGCGCA CCGCCTCCTC GATCGACGAG
CGCGGCGACG TCCCGACCGA CCCGGAGTGG GCCGCGACGC TGCAGCCGGG CTGGCAGGAG
GAGCGGCGGC GCAACTTCCA CGCCGCGGCC TACGAGGCGT TCGCGCCGGG ACAGCCGGAC
CTGATCTGCG ACGGCTGGAC CGAGATCAGC CGTAACCTGC AGGCGCAGCT GGACGCCACC
AACGGCTGGG CGGCGCTGGC GGACCCGGCG AAGTTCATGG AGCTGCGCGA GCTCGTCGAC
TACCAGGTGA TGGAGCGGCT GCGACAGCGG ATCGACGCGA TCGTCGAGGA CCCGGTGACC
GCCGAGGCGC TCAAGCCGTA CTACCGGTTC CTGTGTAAGC GGCCGTGCTT CAACGACCTC
TACCTGCCGA CCTTCAACCG CCCCAACGTC ACCCTGATCG ACGTGGCGGA GACCAAGGGC
GTCGAGCGGA TCACCGCGAA GGGCATCGTC GCGCACGGCG TCGAGTACGA GGTCGACTGC
ATCGTGTTCG CCAGCGGTTT CGAGATCACC AGTGACCTCG ACCGCCGGCT CGCGATCCAC
CCGTACGCCG GCCGCGAGGG CCGGTCGCTC TACGAGCACT GGGTCGACGG CTACCGCACC
CTGCACGGGA TCATGACCCA CGGCTTCCCG AACCAGTTCT TCACCGGCTA CATCCAGGGC
GGCGTCACGG CGGCCGTGCC GGCGATGTTC GAGCAGCAGG CCGTGCACAT CGCCCACATC
ATCTCCGAGA CCCTCGGCCG TGGCGCGGTG ACCGTCGAAC CCACCTTCGA CGCGCAGGAG
AGGTGGGTCG CGGCGGTCCG GGCCGCGGAG ATCGACCAGA CCAACTTCGC GCGGGAGTGC
ACGCCCGGCT ACTACAACAA CGAGGGCGAG CAGCGGATCC GGTCGGTCCT CGGCGACCCC
TACTGGGCCG GCTTCTACGC ATTCGGCGAG CTGCTGCAGG AGTGGCGCGA CTCAGGCGAT
ATGGCGGGGC TCGTCCTCGG CAAATGA
 
Protein sequence
MQNCAPTETP EVDHDALRTK YLGERDKRLR REGQKQYLVT EGDFEEFYEA DPYTPVVPRT 
PIVEDIDVAI LGGGWAGLVA AARLKQEGVT NFRIIELAGD FGGVWYWNRY PGIQCDTDAY
CYLPLLEEVG YIPKEKYARG DECLEHAQRI GKHFGLYEHG VFSTLVRSQE WDERTNRWKI
TTNRGDEISA RFVVMCQGPF NRPKLPGIPG IADYQGHTIH TARWDYEYSG GDLHGGLDKL
AGKKIAVIGT GASGVQVIPH IAQTAEHLYV FQRTASSIDE RGDVPTDPEW AATLQPGWQE
ERRRNFHAAA YEAFAPGQPD LICDGWTEIS RNLQAQLDAT NGWAALADPA KFMELRELVD
YQVMERLRQR IDAIVEDPVT AEALKPYYRF LCKRPCFNDL YLPTFNRPNV TLIDVAETKG
VERITAKGIV AHGVEYEVDC IVFASGFEIT SDLDRRLAIH PYAGREGRSL YEHWVDGYRT
LHGIMTHGFP NQFFTGYIQG GVTAAVPAMF EQQAVHIAHI ISETLGRGAV TVEPTFDAQE
RWVAAVRAAE IDQTNFAREC TPGYYNNEGE QRIRSVLGDP YWAGFYAFGE LLQEWRDSGD
MAGLVLGK