Gene Franean1_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0368 
Symbol 
ID5668792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp437566 
End bp438852 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID641239300 
Productvon Willebrand factor type A 
Protein accessionYP_001504740 
Protein GI158312232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00177271 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000715031 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGCGT TCACGGCGAA GGTGTACCAG AACGAGTTCC TGCCGGTCGG CGGGACCCAG 
GTACACGCGG TGATCACGGT GACCTCGACG GGCGCCCCGG CCGCCCCCCC GATCGCGGGC
CGGCCGACCG GGCGTCCGGA GCAGGCCCTG GTCATCCTGC TCGACTGCTC CGGCTCGATG
GCGAACCCGC CCGCGAAGGT CACCCAGGCC CGCCGCGCCG TCCGGGCGGC GCTCGACAGC
CTGCCCGACG GGGCGTGGTT CGCGGTGGTG CGCGGCACCG GCTCCGCCGC GATGGCGTAC
CCGCGCTCGC CCGAGCTGGT TCCGGCGTCC GCGGCGACGC GGGCGGCGGC CTGCCACGTG
GTGGACGCGC TCGAGCCGCA CGGCGGCACC GCCATGGGCC GCTGGCTGCG GCTGGCGAAC
GACCTGCTGG CGACCCGGCC CGACGCCATC GGCCACGCGC TGCTGCTCAC CGACGGGCAG
AACGGCGAGA TGGAATCCGA GCTGCTCGGC GCCGTCGACG CCTGCCAGGG CCGGTTCCAG
TGCGATTGTC GAGGGGTGGG CACCGACTGG CGGGTGGAGG AGCTGCGGGC GATCGCCACC
GGCATGCTGG GGACGGTGGA CGCCGTCCCC GAGCCCGCCG GCCTCGCGGC CGAGTTCGAG
CGGATCGTGG CCACCGCCCT CGACCGGGCC ACCGACCGGG TCTCGCTGCG GCTGTGGACG
CCCACCGGTG CCTCGCTGGA CTTCCTGCGC GAGGTCACCC CGGACCTGCG GGACCTCACC
GGCTCCGGCC GGGTCGTCGA CGACCACTGC ACGGACTACC AGACCGGGGC CTGGGGGATC
GAGTCCCGGG ATTACCACCT GTGCGTCCGC CTGCCGGCCC GCGAGGTCGG CACCGAGGTC
CTGGCGGCCC GCGTAAGCCT CGTCGTCGAC CACCAGCCGA CGTCATCCGC GCTGGTACGC
GCACTGTGGA CGGATGACAC GGCACTGGCC ACCCGGGTCA ACACGGAGGT TGCGCATTAC
ACCGGTCAGG CCGAGCTGGC CAGGGTCCTG GCGGACGGGC TGGAGGCCCG TCAACAGGGA
GACGACGTCA CTGCCACACT GAGGCTTGGG CGGGGAGTAC AGATCGCGCT CGCGTCAGGC
AACGAAGCAA CCTACCGCCT CCTACAGAAG GTGGTTCACA TCGACGATCC CACAACGGGT
ACGGTTCGTC TGAAGAAAAA CGTCGAGAAG ATGGACGAGA TGGTCCTCGA CTCGAGGTCA
ACCAGGACTG TCCGGGTGAA CAGGTGA
 
Protein sequence
MSAFTAKVYQ NEFLPVGGTQ VHAVITVTST GAPAAPPIAG RPTGRPEQAL VILLDCSGSM 
ANPPAKVTQA RRAVRAALDS LPDGAWFAVV RGTGSAAMAY PRSPELVPAS AATRAAACHV
VDALEPHGGT AMGRWLRLAN DLLATRPDAI GHALLLTDGQ NGEMESELLG AVDACQGRFQ
CDCRGVGTDW RVEELRAIAT GMLGTVDAVP EPAGLAAEFE RIVATALDRA TDRVSLRLWT
PTGASLDFLR EVTPDLRDLT GSGRVVDDHC TDYQTGAWGI ESRDYHLCVR LPAREVGTEV
LAARVSLVVD HQPTSSALVR ALWTDDTALA TRVNTEVAHY TGQAELARVL ADGLEARQQG
DDVTATLRLG RGVQIALASG NEATYRLLQK VVHIDDPTTG TVRLKKNVEK MDEMVLDSRS
TRTVRVNR