Gene Franean1_7319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7319 
Symbol 
ID5675620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8946847 
End bp8948673 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content73% 
IMG OID641246156 
Productvon Willebrand factor type A 
Protein accessionYP_001511544 
Protein GI158319036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.505027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGTG CCCGCCACCG CCACCGCCGT TCGGGCCCCT CGTTACGGGG GGTCGCGGCG 
GTGGCTGCGG TACCCCTCAT GGTCGGCGCC CTGACCTGCG GATGGCTCGT CCTGCGCGGA
GGGGCCGGCC CCGTTCGCTG CGATCGCACG ATCACCCTCG GGGTCACCAC CTCGCCGAGC
CTGGCCACGG CGTTGAGCGA GGCCGCCGCG GCCTACGGCA GCGGGAAGCC GACGGTGTCC
GGGTACTGCG TGTCCGTCCG GGTCGACACG GCGGGCGGCG GCCAGGTCGC CTCCTACATG
AGAGGCGGCT GGACGGACCC GACGGCCGGT CCGATCCCGG ACGTCTGGGT GCCGGACTCC
ACGGACTGGC TCACGCTGGC GCGGACCACC GAGCCGGCCA ACCGGCTGCT GGTGGACACC
GGCACCGTTA TCGCCACCTC ACCGGTCGTC ATCGCGATGC CCCGCCCGAT GGCCGAGGTG
TTCGGCTGGC CGCGTCGCGA GCTCTCCTGG GCCGACCTGC GCAAGCTCGG TGGCGACGAG
GGCTACTGGG GGTCACGGGG ACGGCCGGCC TGGGGCGGTT TCACCGTCGG GCTGCCCGAC
CCCCGGGTGT CGGCTGCCGG GATGACCGCG CTGGCCGACG CCGTGGCCGC CGCGCTGAAG
ACCCCGGTCG AACGGCTCAC CGAGGACATG TTCACCGACG GCCTCGCGGC CAAGGGTGCC
CTCCTGGATC TGGAGCGCTC GTCGGCGCTG GTAGCCGCCT CCGACACCGA CCTGCTCACG
GCCGTCCGCG CGGCGGACCT CGAGGACCCC GCGGCGACCA GGCTCACCGC GTTCCCGCTT
CAGGAGAGTC TCGTCTACCA GTACAACCGC CGGGTGGGCA TCGGCGCCGC GCTGCCGGAC
GGCCGCGGAC CGGAGCTGGC CGCCTTCTAC CCGCGGGACG GCACCGAGCT CGACGAGATC
CGGTACACCG TGCTGAGCCG GGCGTCGGAC GACCCGGTGA AGGCCGAGGT GGCGCGGGAC
TTCCTGCGGA CGCTGACGTC CGGGCCGGGG CGGGTCGCCC TGCTCGGGAA CGGCCTGCGC
CCCCCGGACG GCATCGCCGA CTCGTTCACA GCCCGGACGG GGCTCACCCC GCGGCCGCGG
ATGACGCCCG AACGCACCCT GGACGCGACG GTGCTGACCG CGCTCCAGGG GAGCTTCGCA
GGCGTCCATC AGCGCGGGAA CACCCTCGCC GTGCTGGACA CCTCGGGCTC CATGAACGAG
GAGGTGCCGG GCAGCGCGGG CCGCAGCCGG CTGTCGGTGG CGCTCGACGC GGCGAAGTCC
GCCATCCCGC TGTTCGCCGA AGACAGCGAT CTCGGGCTGT GGCAGTTCTC CACCCGGCTG
CGCGGCGACC AGGACTGGGA GGAGCTCGTG CCGCTCGGGC CGATGGGCGA GCGGCTGGGC
GCCGGCACGC GCTCGCAGGC GGTGATGGAC GCGGTGAACC GGATCGAGCC GCGCGGCGAC
ACCGGGCTCT ACGACACGGC CCTCGCCGCG TTCCGCTACA TGAACCAGCA CTATGTGCCG
GGCCGGCCCA ACCAGGTCGT GCTGCTGACC GACGGGAAGA ACTCCGATCC CGGCAGCATC
GCGCTCGACG AGCTGGTGCG GATCCTGCGC CGGGAGTACT CGCCCCAGCG GCCCGTCCAG
GTGATCACGA TCGGCTATGG CGCCGACACG GATCTCGCCG CGCTGTCGCG GATCTCGGCC
GCGACCGGAG CCGAGACGTA TCCCGCGCTG GACCCGAACA CCATCTTCGA GGTCCTCGTC
GACGCGCTGA CCGAGGTTCC CGGCTGA
 
Protein sequence
MPGARHRHRR SGPSLRGVAA VAAVPLMVGA LTCGWLVLRG GAGPVRCDRT ITLGVTTSPS 
LATALSEAAA AYGSGKPTVS GYCVSVRVDT AGGGQVASYM RGGWTDPTAG PIPDVWVPDS
TDWLTLARTT EPANRLLVDT GTVIATSPVV IAMPRPMAEV FGWPRRELSW ADLRKLGGDE
GYWGSRGRPA WGGFTVGLPD PRVSAAGMTA LADAVAAALK TPVERLTEDM FTDGLAAKGA
LLDLERSSAL VAASDTDLLT AVRAADLEDP AATRLTAFPL QESLVYQYNR RVGIGAALPD
GRGPELAAFY PRDGTELDEI RYTVLSRASD DPVKAEVARD FLRTLTSGPG RVALLGNGLR
PPDGIADSFT ARTGLTPRPR MTPERTLDAT VLTALQGSFA GVHQRGNTLA VLDTSGSMNE
EVPGSAGRSR LSVALDAAKS AIPLFAEDSD LGLWQFSTRL RGDQDWEELV PLGPMGERLG
AGTRSQAVMD AVNRIEPRGD TGLYDTALAA FRYMNQHYVP GRPNQVVLLT DGKNSDPGSI
ALDELVRILR REYSPQRPVQ VITIGYGADT DLAALSRISA ATGAETYPAL DPNTIFEVLV
DALTEVPG