Gene Francci3_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3094 
Symbol 
ID3904220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3665415 
End bp3666629 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID637880415 
Productarsenite-transporting ATPase 
Protein accessionYP_482180 
Protein GI86741780 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.968228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTGTG TGCTCTTCAC GGGGAAGGGC GGCGGCGGTA CCACCACGGT GGCGGCGGCG 
ACCGCGATCC TCGCCGCCCA ACGGGGTCAC CGGACCCTCG TGCTGTCGGT CGACCCGGCC
GCGGGGCTCG CCGGCGCGCT CGACCACCCG ATCGGCGCCG AGCCGACCGA GCTTGAACCG
GGGTTGCACG GCCAGCAGGT CGACCTGCGC CGCGCGGTCG AGACCCGGTG GCCGGCCGTA
CGCGAGGTGC TGGCGGGCAC CTGGCCGGCC ATCAACGTCG ATCCGTTCGA CCTGGAGGAG
TTGGCCTTCC TGCCGGGGGC CGTCGAGACC CTGACGCTGC TCGAACTGCG CGACGGTCTC
ACCAGCGAGA ACTACGACCT GGTGGTGGTC GACGGAGGCC CGGCGGCGGC GCTGGTGCGG
CTGCTGGCCT TTCCCGAGAC GCTGTCGTGG TACTGCCGCC GGCTGCTGCC GCCCGACGGT
GCCTTCGCGC GCTGGCTGCG ACCCGGTTTC GGCTGGGCGG CGGCGCTCGG CGGGCGGTGG
AGCGCGCTGG CGGCCCCCGC CTACGACACC GTCTCCCGCC TGCACCGGGC GGCGGTCGAC
CTGCGCGCGA TCCTCACCGA CCACGCCACC ACCAGCGTCC GGCTGGTGGC GACCCCCGAG
AGCTCGGCGC TCGCCGCGGC CCGCGGCAGC TTTACGGCGC TGTCGCTGCA CGGGTTCACT
CTGGACGGCG TCGTCGTCAA CCGGATCTTT CCGCCGGCGA ACGCGGACGC CTGGCGGGCG
GGGTGGGCCG CCGTTCACCG CGAGCAGCTC GCCGACATCA CCGCGGCCTT CATGCCCACC
CCGGTCCTCC CGGTCGGCTA CCGGGCGGGC GAACCGATCG GCCTGGAGGA GCTGGCCGCT
TTCGGCGCGG CGACCTACGG TGAGCTCGAT CCGGGGAGTG TTCTCGGGGA GCCGCTCATC
GGCCCGAGCG GCCAGCCGCG GGTGGAACGC ACCGAGGACG GGTTCGCGCT GTCCTTCGGC
CTGCCGTTCG TCGACAGTTC CCAGATCGAT CTGGCCCGGC TGGGTGATGA CCTGGTGGTG
ACCGTGGGGT CGTACCGGCG GGCTGTTCCG CTGCCCGCGG CGTTGCGTCG CTGCGACGTG
AGCACCGCCC GGCTGCGTGA CGACCGGCTG GTCGTCTCGT TCGTTCCGGA CCCGCGGCAG
TGGGTGCGGG TATGA
 
Protein sequence
MRCVLFTGKG GGGTTTVAAA TAILAAQRGH RTLVLSVDPA AGLAGALDHP IGAEPTELEP 
GLHGQQVDLR RAVETRWPAV REVLAGTWPA INVDPFDLEE LAFLPGAVET LTLLELRDGL
TSENYDLVVV DGGPAAALVR LLAFPETLSW YCRRLLPPDG AFARWLRPGF GWAAALGGRW
SALAAPAYDT VSRLHRAAVD LRAILTDHAT TSVRLVATPE SSALAAARGS FTALSLHGFT
LDGVVVNRIF PPANADAWRA GWAAVHREQL ADITAAFMPT PVLPVGYRAG EPIGLEELAA
FGAATYGELD PGSVLGEPLI GPSGQPRVER TEDGFALSFG LPFVDSSQID LARLGDDLVV
TVGSYRRAVP LPAALRRCDV STARLRDDRL VVSFVPDPRQ WVRV