Gene Franean1_6734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6734 
Symbol 
ID5675047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8189335 
End bp8190492 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID641245583 
ProductATPase 
Protein accessionYP_001510974 
Protein GI158318466 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0140161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0815898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC CTGCCCCGAC CCCCTCCGCC CCCGCCGCCT TTCGCCTAGC ACCAGGCGGA 
CTGCACGAAA TGGTCGCCCG CTACCTGCTC GATCACCCTT CCGCATCCCA CACACCGACC
GCCGTGGCGA AGGCGTTGAC CCGTAGCAAC GGCGCCGTGG GCAACGCGCT GCACCGTCTG
GCCGAGGCCG GGCAGGCGAC GTTGACCAGC ACCAAACCCC GCCGGTACAC GGCCACCCCC
ACCACGCGGG ACGCATTCGG ACGGACCGGC ATACCGCCCG TGGCAAGGCC CCGCCCGGCC
CCGGCACCGC CCCCGGCGGC GGTGCCGAAG CCCCGGCCGG CCCTTCCGCC CACGACGCCG
GATGGGGGGA TCATCCGCCC GTCGGGGCAG GTGTACCGGC CCCGGAAGCT GGCGGACCTG
CCCGACGTGG AAGTCCTGCG CAAGCTACGC ACGGCCGAAG TACCGGTCCT GCTCTACGGC
CCGCCCGGTA CGGGGAAGAC GAGCGTGATC GAGGCCGCCT TCGGCGACGA TCTGATCACG
ATCGCGGGGG ATGGTGACAC CCAGGTCGGT GACCTGATCG GTGAGTACAC CCAGACCCCC
GACGGCCGGT ACGAGTTCGT CTACGGGCCG CTCATCACCG CCATGCAGGA GGGGAAGGTC
CTCCTCGTCG ACGACGCCAC TCTGATCAGC CCGGCGGTGC TCGCGGTGAT GTACCCGGCG
ATGGACGGCC GGAAAAGGAT CATCGTGAAG GCGCACAAGG GCGAGGCGGT AGAAGCAGCC
CCCGGCTTCT ACGTGATCGC TGGACACAAC CCGGGTGTCC ACGGGGCGAT CTTGAGCGAA
GCGTTGTCAT CCCGTTTCGC GGTGCAGGTC GAAGTGTCGA CCGACTTCGA TCTCGCCACC
AAACTCAAGA TCGACAGCAG GGCGGTGCGG GTCGCGCGGA ACCTCGCGCG GCGCCGCGAG
TCCGGGGAGA TCGGCTGGTC CCCGCAGCTG CGAGAGCTGA TCGCCTTCCA GAAGATCGCG
GACGTGTTGG GTGTCCCGGC GGCGGCAGCG AACCTGATGG GGATCGCGCC GGCCGAGGAC
CGGCCGGTGG TCGCGGACAC GGTCGAGAAG GTCTTCGGGA TCAAGCTCGC GCCCCTCGCC
CTCGGCAAGC AGATCTAA
 
Protein sequence
MSSPAPTPSA PAAFRLAPGG LHEMVARYLL DHPSASHTPT AVAKALTRSN GAVGNALHRL 
AEAGQATLTS TKPRRYTATP TTRDAFGRTG IPPVARPRPA PAPPPAAVPK PRPALPPTTP
DGGIIRPSGQ VYRPRKLADL PDVEVLRKLR TAEVPVLLYG PPGTGKTSVI EAAFGDDLIT
IAGDGDTQVG DLIGEYTQTP DGRYEFVYGP LITAMQEGKV LLVDDATLIS PAVLAVMYPA
MDGRKRIIVK AHKGEAVEAA PGFYVIAGHN PGVHGAILSE ALSSRFAVQV EVSTDFDLAT
KLKIDSRAVR VARNLARRRE SGEIGWSPQL RELIAFQKIA DVLGVPAAAA NLMGIAPAED
RPVVADTVEK VFGIKLAPLA LGKQI